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ABSTRACT 


Antimicrobial resistance is an urgent global health 
challenge in human and veterinary medicine. Wild 
animals are not directly exposed to clinically relevant 
antibiotics; however, antibacterial resistance in wild 
animals has been increasingly reported worldwide in 
parallel to the situation in human and veterinary 
medicine. This underlies the complexity of bacterial 
resistance in wild animals and the possible 
interspecies transmission between humans, 
domestic animals, the environment, and wildlife. This 
review summarizes the current data on expanded- 
spectrum -lactamase (ESBL), AmpC B-lactamase, 
carbapenemase, and colistin resistance genes in 
Enterobacteriaceae isolates of wildlife origin. The 
aim of this review is to better understand the 
important role of wild animals as reservoirs and 
vectors in the global dissemination of crucial clinical 
antibacterial resistance. In this regard, continued 
surveillance is urgently needed worldwide. 


Keywords: AmpC; ESBLs; IMP, mcr-1; NDM; Wild 
birds 


INTRODUCTION 


Over several decades, antimicrobial resistance has become a 
global clinical and public health threat against the effective 
treatment of common infections caused by resistant pathogens, 
resulting in treatment failure and increased mortality (WHO, 
2014). The development of bacterial resistance is a natural 
evolution of microorganisms, but the widespread use and 
misuse of antibacterial agents in humans and animals has 
accelerated this process (WHO, 2014). Furthermore, the 
increasing frequency of global travel and trade has also 
contributed to the rapid worldwide spread of antimicrobial 
resistance (Laxminarayan et al., 2013). Some resistant clones, 
such as Escherichia coli ST131, Klebsiella pneumoniae ST258 
and ST11, and methicillin-resistant Staphylococcus aureus 
(MRSA) USA 300, which are involved in the spread of 
resistance to crucially significant antibiotics in human medicine, 
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have been widely disseminated (Lee et al., 2016; Mathers et al., 
2015; Nimmo, 2012). Antimicrobial resistance is a complex and 
multifaceted problem involving humans, animals, and the 
environment. However, the role of wildlife in the emergence of 
antibacterial resistance might be underestimated. The first 
report of antibacterial resistance in wildlife revealed 
chloramphenicol resistance in E. coli isolates obtained from 
Japanese wild birds (Sato et al., 1978). Since then, the 
occurrence of resistant bacteria in wild animals has been 
increasingly reported within diverse animal species across 
different geographical areas. In addition, several important 
antimicrobial resistant pathogens, such as MRSA (Loncaric et 
al., 2013a; Porrero et al., 2014), vancomycin-resistant 
enterococci (Drobni et al., 2009; Sellin et al., 2000), Salmonella 
spp. (Lee et al., 2011a), Vibrio cholerae (Aberkane et al., 2015), 
and Campylobacter spp. (Weis et al., 2016), have been 
described in wild animals, highlighting the importance and 
complexity of wildlife, not normally exposed to antibiotics 
directly, in the transmission of resistant bacteria. 

This review gives a brief overview of the emergence and 
prevalence of expanded-spectrum B-lactamase (ESBL), AmpC 
B-lactamase, carbapenemase, and colistin resistance genes in 
Enterobacteriaceae strains from wild animals, all of which have 
significant public health impact. Furthermore, this review aims 
to better understand the role of wildlife in the transmission of 
Clinically significant antimicrobial resistance in Enterobacteriaceae. 


ESBL-PRODUCING 
WILDLIFE 


ENTEROBACTERIACEAE FROM 


The global dissemination of ESBL-producing Enterobacteriaceae 
in human clinics is an urgent problem that poses a serious 
challenge to the treatment of infectious diseases, particularly 
the worldwide emergence of CTX-M-15-producing ST131 E. 
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coli (Alghoribi et al., 2015; Blanco et al., 2013; Hansen et al., 
2014; Hussain et al., 2014; Mathers et al., 2015; Platell et al., 
2011; Sauget et al., 2016). ESBL-producing Enterobacteriaceae 
have also been increasingly reported in livestock, companion 
animals, and food (Aliyu et al., 2016; Braun et al., 2016; Ewers 
et al., 2010; Hordijk et al., 2013; Michael et al., 2016). The CTX- 
M-type B-lactamases are the most common ESBLs among 
Enterobacteriaceae isolates of human and veterinary origin 
worldwide (Hordijk et al., 2013; Liu et al., 2016a; Pietsch et al., 
2017; Wang et al., 2016; Wellington et al., 2013). 

Since the first report on ESBL-producing E. coli isolates from 
wild animals in Portugal in 2006 (Costa et al., 2006), ESBL- 
producing Enterobacteriaceae of wildlife origin have so far been 
reported in Europe, Africa, Asia, South America, North America, 
and Australia (Table 1). Although ESBLs have been found in 
various Enterobacteriaceae, most ESBL-producing bacterial 
pathogens in wild animals are E. coli, followed by K. 
pneumoniae (Table 1). To date, at least 80 wildlife species have 
been found to be carriers of ESBL-producing Enterobacteriaceae, 
most being wild birds (Table 1). Similar to that among isolates 
from human and veterinary medicine, the CTX-M family is the 
most prevalent type of ESBL-producing Enterobacteriaceae 
found in wild animals (Table 1). Both blactx-m-1 and blactx-m-15 
are commonly reported in wild animals and are the most 
prevalent ESBL genes, followed by blacTx-u-a4, blactx-m-32, 
blactx.m-9; blactx-m-3; blactx.-2; and blactx.m-22. Other ESBL 
genes, such as blactx.me7, blactxss, blactx.ms, blactx-m-24, 
blactx-m-25, blactx-m-28, blactx-m29, and blactx-m-124, have also 
been detected, though infrequently (Table 1). Significant 
geographical differences have been observed in the occurrence 
of CTX-M enzymes. As summarized in Table 1, CTX-M-15 is 
the only reported CTX-M-type B-lactamase in Africa to date, 
and is the most common CTX-M-type enzyme reported in 
Bangladesh. In Canada and the US, CTX-M-14 is dominant, 
followed by CTX-M-15. Diversity in CTX-M B-lactamases has 
been reported in European countries, with the predominance of 
CTX-M-1 and CTX-M-15. Interestingly, CTX-M-15 is also 
reported to be the most common CTX-M enzyme in Franklin’s 
gulls (Leucophaeus pipixcan) in northern Chile (Baez et al., 
2015), although CTX-M-1 was previously reported to be 
dominant in the same gull species in central Chile (Hernandez 
et al., 2013). Baez et al. (2015) hypothesized that, based on 
their migratory habits, Franklin’s gulls from the north acquired 
resistant CTX-M-15-producing ST131 and ST10 E. coli clones, 
which are highly prevalent in humans in the US and Canada but 
scarce in Chile. However, this hypothesis, though possible, 
needs further investigation. 

In addition to CTX-M enzymes, SHV and TEM enzymes have 
also been reported in wildlife, especially SHV-12 and TEM-52, 
which accords with that found in ESBL-producing isolates from 
humans, livestock, and companion animals (Table 1) (Blanco et 
al., 2013; Carattoli et al., 2005; Hordijk et al., 2013; Michael et 
al., 2016; Smet et al., 2010). For example, SHV-12 has been 
frequently detected in wildlife in Spain (Alcala et al., 2016; 
Gongalves et al., 2012), and is highly prevalent in ESBL- 
producing E. coli obtained from 8- to 16-month-old healthy 
children in northern Spain (Fernandez-Reyes et al., 2014) and 
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in raw poultry meat from southern Spain (Egea et al., 2012), as 
well as from hospitals (Blanco et al., 2013). Other SHV-type 
enzymes, such as SHV-102, SHV-1, SHV-2, and SHV-5, and 
TEM-type enzymes, such as TEM-19, TEM-40, TEM-176, and 
TEM-20, have also been sporadically reported in wild animals 
(Table 1). 

More than 170 different sequence types (STs) have been 
identified in ESBL-producing E. coli isolates of wildlife origin 
(Table 1). Among them, ST131 is the most commonly detected 
clone. The dominant ST131 clone identified in wild animals, 
which has been frequently described in humans, companion 
animals, food products, and the environment, is involved in the 
international dissemination of blactx.mis and blacrx-M-44 
(Alghoribi et al., 2015; Bogaerts et al., 2015; Ewers et al., 2010; 
Hu et al., 2013; Hussain et al., 2014; Kawamura et al., 2014; 
Kim et al., 2017; Mathers et al., 2015). Additionally, other STs 
described in wild animals, such as ST10, ST69, ST405, ST410, 
and ST648, have also been reported in various sources and are 
responsible for the intercontinental distribution of CTX-M 
(Fischer et al., 2014, 2017; Hansen et al., 2014; Hu et al., 2013; 
Liu et al., 2016a; Muller et al., 2016; Su et al., 2016; Wang et al., 
2016). However, some STs found in wildlife, such as ST1340, 
ST1646, ST2687, ST3018, and ST3056, have been identified 
as new types and have not yet been reported in human or 
veterinary isolates (Bonnedahl et al., 2010; Hasan et al., 2014; 
Jamborova et al., 2015). 

As for ESBL-producing K. pneumoniae, limited studies are 
currently available on the clonal group of K. pneumoniae 
from wildlife (Table 1). Loncaric et al. (2016) found an SHV- 
11-encoding K. pneumoniae strain from mouflon (Ovis 
orientalis musimon) in Austria belonging to the epidemic 
clone ST11, which is associated with carbapenemase (Hu et 
al., 2016; Kim et al., 2013; Lee et al., 2016; Voulgari et al., 
2016) and ESBL in humans worldwide (Hu et al., 2016; Lee 
et al., 2011b; Lu et al., 2016; Sennati et al., 2012), and 
previously described in companion animals and Eurasian 
beaver (Castor fiber) (Donati et al., 2014; Pilo et al., 2015). 
In Algeria, all 17 blactx-v-1s-bearing K. pneumoniae isolates 
found in wild boars and Barbary macaques belong to ST584, 
which has also been detected in silver gulls as carriers of 
carbapenemase IMP-4 in Australia (Dolejska et al., 2016) as 
well as in human in Brazil (http: //bigsdb.pasteur.fr/klebsiella/ 
klebsiella.html). 

Successful clones found in humans and domestic and wild 
animals indicate possible interspecies transmission of ESBL- 
producing isolates. However, horizontal transfer mediated by 
mobile elements, such as insertion sequences and plasmids, is 
also one of the main methods for ESBL dissemination 
worldwide (Carattoli, 2013; Partridge, 2015). Only limited 
(mostly European) studies are available on ESBL-encoding 
plasmids in wild animals (Table 1). For example, blactx-m-15 is 
reportedly associated with IncF plasmids (mostly multiple 
replicons containing IncFIA and IncFIB) and Incl1 (Guenther et 
al., 2010a; Loncaric et al., 2016; Poirel et al., 2012; Tausova et 
al., 2012; Veldman et al., 2013), which agrees with previous 
research involving CTX-M-15-producing Enterobacteriaceae 
obtained from the environment, healthy cattle, and humans 
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(Zurfluh et al., 2015). IncHI2 plasmids have also been reported 
as carriers of blactx-m-1s in humans (Harrois et al., 2014; Nilsen 
et al., 2013), companion animals (Haenni et al., 2016), pigs 
(Tamang et al., 2015), and wild birds (Veldman et al., 2013). 
Though rare, Incl2 plasmid has also been described with blactxm-15 
in the lesser black-backed gull from the Netherlands (Veldman 
et al., 2013) and identified in a chicken E. coli strain in China 
(Liu et al., 2015). In Enterobacteriaceae of human and 
veterinary origin, blactxm1 has been frequently found 
associated with IncN and Incl1 plasmids (Carattoli, 2009; 
Jakobsen et al., 2015; Madec et al., 2015). Interestingly, blactxm-1 
has been mainly located on Incl1 plasmids in wildlife in Europe 
as well (Literak et al., 2010a, b; Loncaric et al., 2013b; Veldman 
et al., 2013). However, IncN, IncF, and IncHI1 plasmids have 
also been reported as carriers of blactx.u-1 (Literak et al., 2010a; 
Loncaric et al., 2013b; Veldman et al., 2013). Furthermore, 
Incl1 plasmids are also carriers of other ESBL genes in wild 
animals, such as blactx-m-3, blasuv.12, and blatem.s2 (Poirel et al., 
2012; Veldman et al., 2013). Similarly, IncF plasmids are also 
reported to be associated with blactx-m-3, blactx-m9, blacTx-M-14, 
blactx.v-27; blactx.-32; and blasuv-12 in wildlife (Guenther et al., 
2010b; Poirel et al., 2012; Tausova et al., 2012; Veldman et al., 
2013). The narrow-host-range plasmid IncX has been found to 
carry several ESBL genes, namely blatem-135, blatemsap, blatems7e, 
and blasyy-12, in E. coli isolated from diverse sources in Australia, 
Czech Republic, Spain, and Poland (Dobiasova & Dolejska, 
2016). Notably, plasmid replicon typing was performed on 
ESBL-producing isolates with multiple plasmids in several 
studies, thus the replicons of ESBL-carrying plasmids could not 
be confirmed (Gongalves et al., 2012; Hernandez et al., 2013). 

Insertion sequences also play an important role in facilitating 
the spread of ESBL genes (Partridge, 2015). Though few 
studies are available on horizontal transfer mediated by 
insertion sequences, associations of insertion sequence ISEcp1 
and blactxm, including blacrx-v-4, blacrxm14, blacrxmas, blactx.-27, 
and blacrx-v-32, have been observed in E. coli from wild animals 
(Costa et al., 2006; Gongalves et al., 2012; Poeta et al., 2008; 
Radhouani et al., 2010; Tausova et al., 2012). 

In summary, the ESBL gene types identified in wild animals 
are the same as those in human and veterinary medicine. Thus, 
interspecies transmission mediated by successful pandemic 
ESBL-producing clones and plasmids in humans, domestic 
animals, and wildlife might occur. 


PLASMID-MEDIATED AmpC B-LACTAMASE-PRODUCING 
ENTEROBACTERIACEAE FROM WILDLIFE 


Plasmid-mediated AmpC B-lactamases among Enterobacteriaceae 
in human and veterinary medicine are of considerable global 
concern because they confer resistance to clinically important 
cephalosporin antibiotics and B-lactamase inhibitors (Jacoby, 
2009; Smet et al., 2010). CMY-2 is the most prevalent AmpC B- 
lactamase and has been globally disseminated among 
Enterobacteriaceae in humans, companion animals, food- 
producing animals, and retail meat (Bogaerts et al., 2015; 
Carmo et al., 2014; Hansen et al., 2016; Jacoby, 2009; Ma et al., 
2012; Smet et al., 2010; Vogt et al., 2014; Wu et al., 2015). As 
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shown in Table 2, AmpC B-lactamases have been reported in E. 
coli, K. pneumoniae, and Enterobacter cloacae isolates of 
wildlife origin in Europe, North America, and Asia, and 
particularly in central Europe, similar to ESBL-producing 
Enterobacteriaceae (Table 2). To date, 20 different wild animal 
species, mostly birds, have been identified as blacwy.2 carriers 
(Table 2). Like that in human and veterinary medicine, blacmy-2 
is the most commonly detected AmpC-type B-lactamase among 
wild animals, though its identification has been limited to E. coli 
isolates. 

The blacmy2 gene is mainly located on IncA/C and Incl1 
plasmids from Enterobacteriaceae isolates of human and 
veterinary origin (Bogaerts et al., 2015; Bortolaia et al., 2014; 
Carattoli, 2009; Guo et al., 2014; Sidjabat et al., 2014). Among 
wild animals, blacmy-2 has been reported to be mainly 
associated with Incl1 plasmids in E. coli from Dutch wild birds 
and wild seagulls in the US (Poirel et al., 2012; Veldman et al., 
2013). Other plasmid types, such as IncB/O, IncK, and IncF, 
have also been identified as carriers of blacmy.2 in wildlife (Poirel 
et al., 2012; Veldman et al., 2013), as previously reported in 
humans, companion animals, broiler chickens, and retail meat 
(Bortolaia et al., 2014; Hansen et al., 2016; Hiki et al., 2013; So 
et al., 2012; Vogt et al. 2014). Thus far, the IncA/C plasmid, a 
major carrier of blacmy2, has not yet been identified in wild 
animals. The absence of blacmy-2-bearing IncA/C plasmids 
could simply reflect the limited studies on the characterization of 
blacuy.2-Carrying plasmids in wildlife, or might indicate that 
blacmy.2-harbouring Incl1 plasmids are more successful among 
wildlife. 

In addition to CMY-2, DHA-type AmpC B-lactamase genes 
have also been detected in K. pneumoniae from mouflons (Ovis 
orientalis musimon) in Austria, E. coli from hill mynah (Gracula 
religiosa) in Saudi Arabia, and K. pneumoniae ST11 isolates 
from Eurasian beaver (Castor fiber) in Switzerland (Hassan & 
Shobrak, 2015; Loncaric et al., 2016; Pilo et al., 2015). FOX-5 
encoded by an IncA/C plasmid has been obtained from K. 
pneumoniae isolates in the US (Poirel et al., 2012). 
Furthermore, a novel variant of the ACT AmpC B-lactamase 
gene has been identified in an Enterobacter cloacae strain 
originating from glaucous gull (Larus hyperboreus) in Arctic 
Svalbard, Norway (Literak et al., 2014). 


CARBAPENEMASE-PRODUCING ENTEROBACTERIACEAE 
FROM WILDLIFE 


Carbapenemase-producing Enterobacteriaceae isolates pose 
an urgent public health threat. New Delhi metallo-B-lactamase 
(NDM), as one of the most widespread carbapenemases, has 
been increasingly reported in human clinics, foods, domestic 
animals, and the environment worldwide (Abdallah et al., 2015; 
Chandran et al., 2014; He et al., 2017; Kumarasamy et al., 
2010; Lee et al., 2016; Qin et al., 2014; Toleman et al., 2015; 
Yaici et al., 2016; Yong et al., 2009). 

The first reported carbapenemase-producing bacteria in wild 
animals were isolated from black kites (Milvus migrans) in 
Germany (Fischer et al., 2013). Among 184 cefotaxime- 
resistant Salmonella spp. isolates, only one Salmonella 
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Corvallis isolate belonging to ST1541 has shown reduced 
susceptibility to carbapenem, and carries the carbapenemase 
gene blanow-1 located on ~180 kb IncA/C conjugative plasmid 
pRH-1738 (Fischer et al., 2013). The broad-host-range IncA/C 
plasmids are among the most predominant plasmids associated 
with blanpm-1 in humans (Carattoli, 2013). Fischer et al. (2013) 
supposed that the blanpw-1-bearing Salmonella Corvallis isolate 
might have originated from non-European countries and was 
transferred to Germany through the black kite migratory route, 
since Salmonella Corvallis was prevalent in South-East Asia 
and was emerging in North Africa and Nigeria, rather than in 
European countries. The complete sequence of plasmid pRH- 
1738 further confirms this hypothesis. Plasmid pRH-1738 
exhibited high relatedness with plasmid pMR0211 obtained 
from human Providencia stuartii isolate in Afghanistan, but 
showed distinct differences from other sequenced NDM-1- 
IncA/Cz plasmids from Western countries (Villa et al., 2015). In 
addition, fosfomycin resistance gene fosA3, which has been 
rarely detected in Europe but is prevalent among CTX-M- 
encoding E. coli and K. pneumoniae isolates in Asia (i.e., China, 
Japan, and South Korea), has also been identified on NDM-1- 
producing plasmid pRH-1738. blanpm-1 transferred with fosA3 
on IncA/C plasmid has only been described in clinical E. coli 
and Citrobacter freundii isolates in China (Qin et al., 2014). 
Taken together, these findings suggest that the origin of this 
plasmid might be in the Asiatic region. 

Large-scale transmission of IMP-producing bacteria into 
wildlife was first reported in 2015. In total, 120 carbapenemase- 
producing Enterobacteriaceae of 10 species were obtained 
from silver gulls in Australia, mainly E. coli (n=85), carrying 
blawmp-a, blap-ss, or bla\mp-26 (Dolejska et al., 2016). The blaimp-a 
gene has been found in 116 isolates, and is the most commonly 
detected gene among carbapenemase-producing Enterobacteriaceae 
isolates in human clinics in Australia (Bell et al., 2016; Sidjabat 
et al., 2015). blaimp4 in gulls is carried by various conjugative 
plasmids, mostly IncHI2-N plasmid type, followed by IncA/C 
plasmids, as well as IncL/M and Incl1, and is associated with a 
class 1 integron-containing blaimp4-qacG-aacA4-catB3 array in 
most positive strains (Dolejska et al., 2016). The same array 
carried by IncA/C and IncL/M plasmids is also reportedly 
responsible for the dissemination of blaiwe-4 in clinical isolates in 
Australia (Espedido et al., 2008), and by the IncHl2 plasmid in 
Salmonella Typhimurium from a cat in Australia (Abraham et al., 
2016). Furthermore, 19 different STs have been detected in 
IMP-4-producing E. coli isolates, including five prevalent lineages 
(ST216, ST58, ST354, ST167, and ST224), in which ST58, 
ST354, and ST167 are clinically relevant clone lineages (Ben 
Sallem et al., 2015; Fernandez et al., 2014; Huang et al., 2016). 

Although carbapenem resistance reported in wild animals is 
rare, the emergence of NDM-1 and IMP carbapenemases in 
wild birds is of concern. 


COLISTIN RESISTANCE GENE 
ENTEROBACTERIACEAE FROM WILDLIFE 


mcr-1 IN 


Colistin is widely applied in food-producing animals and is 
currently used as the last resort for treating infections caused 


by multi-resistant gram-negative bacteria (Kaye et al., 2016). 
Since the first identification of the plasmid-mediated colistin 
resistance gene mcr-1 in China in 2015 (Liu et al., 2016b), it 
has been identified in Enterobacteriaceae isolates from food- 
producing animals, companion animals, food products, the 
environment, and humans worldwide (Anjum et al., 2016; 
Doumith et al., 2016; Hasman et al., 2015; McGann et al., 2016; 
Xavier et al., 2016; Zhang et al., 2016; Zurfluh et al., 2016). 

The role of wild birds as reservoirs and vectors for the global 
distribution of mcr-1 should be considered. Recently, mcr-7 was 
described in an E. coli strain isolated from European herring 
gull (Larus argentatus) feces collected from the Kaunas 
(Lithuania) city dump (Ruzauskas & Vaskeviciute, 2016). 
However, the emergence of mcr-1 in wildlife could be traced 
back to E. coli strains isolated in 2012 (Liakopoulos et al., 2016). 
Five extended-spectrum cephalosporin-resistant E. coli isolates 
obtained from kelp gulls in Ushuaia, Argentina in 2012 were 
found to carry mer-1 and blacrx-u-2 (n=1) and blactxm-14 (n=4) 
and exhibited elevated colistin MICs (4-8 mg/L). The mcr-1 
gene was located on a ~57 kb Incl2 plasmid without blactx.y in 
all five isolates. Incl2 plasmids, which have been detected in E. 
coli and Salmonella isolates from food, food-producing animals, 
and humans in China, Great Britain, the US, Venezuela, and 
Denmark, have been reported to be associated with the 
transmission of mcr-1 (Anjum et al., 2016; Delgado-Blas et al., 
2016; Doumith et al., 2016; Hasman et al., 2015; Meinersmann 
et al., 2016; Yang et al., 2016). Notably, four mer-1-carrying 
isolates, which belong to ST744, have been previously 
described in Denmark and carry the mcr-1-bearing Incl2 
plasmid (Hasman et al., 2015; Liakopoulos et al., 2016). 


CONCLUSIONS 


Clinically relevant resistance, such as ESBL, AmpC 
cephalosporinase, carbapenemase, and colistin resistance, has 
been detected in wild animals, particularly wild birds, from 
distinct geographical areas. Thus, wild animals could serve as 
important reservoirs of resistant bacteria. Although the origin of 
bacterial resistance genes in wild animals remains unclear, as 
wildlife are not exposed to antibiotics directly, contact with 
sewage or animal manure might be one possibility (Wellington 
et al., 2013). Additionally, the potential of wild animals as 
vectors of resistant bacteria or genetic determinants should not 
be underestimated. Wildlife, especially migratory birds with their 
instinctive mobility, can carry resistant bacteria over long 
distances, even between continents; thus, this might be a new 
transmission route and partly responsible for the global 
dissemination of bacterial resistance. Contamination of food or 
water by wildlife is recognized as an important risk factor for the 
transmission of antimicrobial resistance or pathogens to food 
animals and humans (Greig et al., 2015). 

Wild animals might play a vital role in the worldwide spread of 
Clinically relevant pathogens or resistance genes. Pandemic 
ESBL-producing E. coli clones or plasmids shared by humans, 
domestic animals, and wildlife further strengthen this hypothesis. 
Thus, continued surveillance of multi-resistant bacteria in wild 
animals is warranted. 
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ABSTRACT 


Food availability significantly affects an animals 
energy metabolism, and thus its phenotype, survival, 
and reproduction. Maternal and offspring responses 
to food conditions are critical for understanding 
population dynamics and life-history evolution of a 
species. In this study, we conducted food 
manipulation experiments in field enclosures to 
identify the effect of food restriction on female 
reproductive traits and postpartum body condition, 
as well as on hatchling phenotypes, in a lacertid 
viviparous lizard from the Inner Mongolian desert 
steppe of China. Females under low-food availability 
treatment (LFT) had poorer immune function and 
body condition compared with those under high-food 
availability treatment (HFT). The food availability 
treatments significantly affected the litter size and 
litter mass of the females, but not their gestation 
period in captivity or brood success, or the body size, 
sprint speed, and sex ratio of the neonates. Females 
from the LFT group had smaller litter sizes and, 
therefore, lower litter mass than those from the HFT 
group. These results suggest that female racerunners 
facing food restriction lay fewer offspring with 
unchanged body size and locomotor performance, 
and incur a cost in the form of poor postpartum body 
condition and immune function. The flexibility of 
maternal responses to variable food availability 
represents an important life strategy that could 
enhance the resistance of lizards to unpredictable 
environmental change. 


Keywords: Eremias multiocellata; Food availability; 
Hatchling; Lizard; Reproductive output 


INTRODUCTION 


In nature, the food available to an animal exhibits spatio- 
temporal variation, significantly affecting its energy metabolism 


Science Press 


and consequently its phenotype, survival, and reproduction 
(Acheampong et al., 2011; Douhard et al., 2014; Du, 2006; Hoy 
et al., 2016). Food availability not only affects maternal fitness- 
related traits (e.g. body size, growth, and reproduction), but 
might also induce phenotypic variations in offspring (Ballinger & 
Congdon, 1980; Hillesheim & Stearns, 1991; Jones et al., 2015; 
Vaissi & Sharifi, 2016). For example, food availability can lead 
to variations in maternal reproduction (Ballinger, 1977) and 
offspring growth and survival in lizards (Dunham, 1978; Warner 
et al., 2015). Therefore, maternal and offspring responses to 
food conditions are critical for understanding ecological and 
evolutionary processes, such as population dynamics and life- 
history evolution. 

Food availability can significantly affect female reproductive 
strategies, such as reproductive timing, investment, and output 
(offspring number and size) (Ballinger, 1977; Du, 2006; 
Ramirez-Pinilla, 2006). Two kinds of trade-offs regarding 
energy allocation are faced by a mother when the energy 
available to her is limited. First, a mother has to decide on 
energy allocation for multiple tasks, such as maintenance, 
growth, and reproduction, leading to important life-history trade- 
offs (e.g., maintenance vs. reproduction) (Hegemann et al., 
2013; Rollinson & Rowe, 2016); for instance, the tropical house 
wren (Troglodytes aedon) decreases parental reproductive 
investment (i.e. nestling feeding frequency), but does not alter 
self-maintenance (metabolic rate and body condition) when the 
cost of activity increases during reproduction (Tieleman et al., 
2008). Second, a mother needs to decide on the distribution of 
energy among offspring within a clutch, thus leading to a trade- 
off between clutch size and offspring size (Bleu et al., 2013; Du 
et al., 2005; Olsson et al., 2002). The “optimal egg size theory” 
assumes that one optimal egg size is appropriate under certain 
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maternal and environmental conditions to maximize the fitness 
of offspring (Einum & Fleming, 1999; Smith & Fretwell, 1974; 
Williams, 1966). Accordingly, a mother might give priority of 
energy allocation to offspring size rather than to clutch size 
when faced with food restriction, especially in those species 
whose fitness is determined primarily by egg size (Styrsky et al., 
2000). However, other studies have shown that offspring size is 
not always optimized, and can be highly condition-dependent in 
some species (Krist & Munclinger, 2015; Rollinson & Hutchings, 
2013), with the shift in size being a function of maternal 
reproductive investment (Caley et al. 2001). In such cases, a 
mother might change both offspring and clutch sizes in 
response to food restriction. Despite extensive studies 
regarding the effect of food availability on maternal reproductive 
traits (e.g. Bonnet et al., 2001; Du, 2006; Hogstedt, 1981; 
Kitaysky et al., 1999; Warner & Lovern, 2014; Warner et al., 
2015), how female reproduction responds to food availability 
might differ among species and deserves further investigation. 

In addition to maternal reproductive traits, food availability 
can also profoundly affect offspring phenotypes and fitness. 
Previous studies have shown that a variety of phenotypic traits 
of offspring (e.g., body size, locomotor performance, and 
growth rate) can be affected by maternal food availability (Du, 
2006; Hafer et al., 2011; Raveh et al., 2016; Warner et al., 
2015), likely through a maternal effect—a phenomenon in 
which environmental information passes through generations 
by means of plasticity rather than direct genetic transmission 
(Fox & Mousseau, 1998; Hsu et al., 2016); for example, low 
food availability compromises the snout-vent length (SVL) of 
offspring, as well as the mass, performance ability, and fat 
reserves of the viviparous lizard Pseudemoia entrecasteauxii 
(Itonaga et al., 2012b). Offspring sex is one of the most 
interesting phenotypes affected by maternal food availability. 
The “sex allocation hypothesis” suggests that females can 
manipulate the sex ratio of their offspring, contingent upon local 
conditions, to maximize offspring fitness (Rosenfeld & Roberts, 
2004; Trivers & Willard, 1973). Food availability can affect the 
offspring sex ratio in several avian species; for example, when 
provided with supplementary food, the kakapo (Strigops 
habroptilus) female produces an excess of males (Clout et al., 
2002). However, such studies are scarce for reptilian species 
(Wapstra & Warner, 2010; Warner et al., 2007). 

Viviparous lizards make an excellent model for studying 
maternal and offspring responses to food availability because 
many lizards experience fluctuating food resources (Meserve et 
al., 2016; Zhu et al., 2014), and viviparous species retain their 
eggs for a longer period than oviparous species do, with higher 
locomotor and thermoregulatory costs (Le Galliard et al., 2003; 
Shine, 2003). Moreover, most ectothermic vertebrates are 
considered capital breeders, in which reproduction is financed 
from stored energetic capital (Bonnet et al., 1998), which makes 
the direct causal relationship between food availability and 
reproduction harder to detect. Conversely, many viviparous 
reptiles are considered income breeders because they can use 
maternal nutrients (matrotrophy) to supplement or replace yolk 
nutrients (lecithotrophy) for embryonic development (Bonnet et 
al., 2001; Ramirez-Pinilla, 2006; Winne et al., 2006). As a result, 
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viviparous lizards may confront more severe trade-offs between 
reproduction and self-maintenance when experiencing limited 
resources during gestation, and thus provide a unique system 
to increase our understanding of the effects of food on life 
history strategies in animals. In this study, we conducted food 
manipulation experiments in field enclosures to identify the 
effect of food restriction on female reproductive traits (such as 
gestation period in captivity and litter size) and postpartum body 
condition, as well as hatchling phenotypes (body size and 
locomotor performance and immune response) in a lacertid 
viviparous lizard from the Inner Mongolian desert steppe of 
China. We aimed to address the following questions: (1) How 
does female reproductive investment respond to food restriction? 
(2) Does food restriction affect postpartum body condition of 
females? (3) How does maternal food restriction influence 
neonate phenotypes? 


MATERIALS AND METHODS 


Study species 

The multi-ocellated racerunner (Eremias multiocellata) is a 
small viviparous lizard that inhabits desert or semiarid areas, 
with the SVL of adults ranging from 58 to 73 mm. This species 
is the main lizard fauna in our field study site at the 
Shierliancheng Field Station, Institute of Grassland Research of 
the Chinese Academy of Agricultural Sciences (Zeng et al., 
2016), located in Ordos, Inner Mongolia, China (N40°12'17”, 
E111°07'43"; elevation 1 036 m). Thermal and hydric 
environments significantly affect the reproductive traits and 
offspring phenotypes in different populations of this species, 
with the offspring sex ratio biased towards males when gravid 
females are kept at high temperatures (Tang et al., 2012; Wang 
et al., 2016). 


Maternal food treatment in field enclosures 

Adult E. multiocellata (16 males and 48 females) were collected 
from our field study site and were housed in round enclosures 
(highxdiameter=40 cmx180 cm) built at the field site (one male 
and three females in each enclosure) from May 20 to May 30, 
2014, which is the beginning of the reproductive season for this 
species. All females had bitemarks on their belly, suggesting 
that they had mated in the field. The enclosures were covered 
with plastic nets to avoid predation by birds. 

On June 1, 2014, the SVL and body mass of all females were 
measured, after which each female was randomly assigned to a 
treatment group with either high (n=24) or low (n=24) prey 
availability. Lizards in the high-food availability treatment (HFT) 
group were fed every other day with 0.05 g of mealworms per 
gram mass of female per day (amounting to 120% of the 
average consumption of a gravid female, as measured prior to 
the experiment), whereas those in the low-food availability 
treatment (LFT) group were fed at the same intervals with 0.025 
g of mealworms per gram mass of female per day (amounting 
to 60% of the average consumption of a gravid female). Each 
treatment was replicated eight times (for a total of 16 
enclosures). Given that all viviparous amniotes have a placenta 
that transfers maternal nutrients to the embryo (Flemming et al., 


2003), viviparous E. multiocellata should be matrotrophic. The 
females of this species start to copulate in May, and give birth in 
July and August, with a gestation period of about 54 days (Tang 
et al., 2012). Our food treatments lasted 30 days (from June 1 
to June 30), covering the middle half of the gestation period. 


Female reproductive traits and phenotypes of neonates 
On July 1, 2014, females (16 females from HFT and 15 females 
from LFT) in the field enclosures were retrieved and transferred 
to the laboratory for measurement of reproductive traits. 
Females were maintained in small cages (longxwidexhigh= 
310x210x180 mm) with a substrate of sand and two small 
pieces of brick as shelter. The cages were exposed to the 
natural light regime of the field station, and a 60W incandescent 
light bulb was suspended 5 cm above each cage for 
thermoregulation from 0800 to 1200h. Food (mealworms and 
crickets dusted with additional vitamins and minerals) was 
provided daily ad libitum. Each cage contained two females and 
was checked once per day for neonates, and four times per day 
following first parturition. 

Immediately after a female produced a litter of neonates, the 
mother was measured and weighed. The SVL and body mass 
of neonates were then measured to 0.01 mm and 0.001 g 
accuracy after the absorption of the yolks one day later. Litter 
size was determined as the number of neonates and litter mass 
was calculated as the total mass of neonates produced by a 
female (Li et al., 2006; Ramirez-Bautista et al., 2000). The 
gestation period in captivity was calculated as the days 
between the initiation of treatment and female parturition, which 
did not include the gestation period in the field before the 
females were collected. To measure the locomotor performance, 
each neonate was made to run on a racetrack (80 cm long, 
marked at 20-cm intervals), within 2-3 days of birth, by 
stimulation with a soft paintbrush (Irschick & Losos, 1998). This 
procedure was repeated twice at 30+1 °C allowing a break of 1 
h between each test. For quantitating locomotor performance, 
the sprint speed was determined by averaging the fastest 
speeds for covering a distance of 20 cm in each race. The sex 
of neonates was identified by observing the preanal scales— 
males have large, square, regularly-distributed preanal scales, 
whereas females have small, round, and scattered preanal 
scales (Wang et al., 2016). 

A total of 12 and 14 females produced offspring from the LFT 
and HFT groups, respectively. Offspring from four clutches in 
the LFT group and one clutch in the HFT group were stillborn 
and excluded from further analysis of locomotor performance 
and offspring sex. The brood success of females was 
calculated as the number of females producing live neonates / 
total number of females. 


Cellular immune response of postpartum females 

Cellular immune response was assessed by administering an 
injection (20 uL) of 50 mg of phytohemagglutinin (PHA) in the 
right foot of postpartum E. multiocellata females. The thickness 
of the right and left feet were measured every 12 h from 0 to 36 
h following the PHA injection. The difference in foot thickness 
was considered as an index of immune response (Brown et al., 


2011; Vinkler et al., 2010). The largest difference in thickness 
(i.e., peak immune response) was found at 12 h. Thus, PHA- 
induced skin swelling at 12 h was used to identify the effect of 
food availability on the immune response of females. 


Statistical analysis 

All analyses were performed with SPSS Statistics software (ver. 
22; IBM Corp. 2014). Data were normalized by log- 
transformation when necessary. Maternal body condition was 
calculated as a residual score from the regression of body 
mass on SVL (Jakob et al., 1996). Treatment effects on 
maternal body condition, PHA response, reproductive output, and 
neonate traits were evaluated by one-way analysis of variance 
(ANOVA). Clutch means were calculated for neonate traits to 
avoid pseudo-replication. Pearson Chi-Square test was used for 
comparing between-treatment differences in brood success. 


RESULTS 


At the beginning of the experiment, both male and female body 
sizes did not differ between food treatments (Table 1). However, 
the postpartum body conditions were worse for females in the 
LFT group than for those in the HFT group (F1,24=7.928, 
P=0.010) (Figure 1A), although initial body condition did not 
differ between the two food treatment groups (F1.24=1.162, 
P=0.292). In addition, females from the LFT group had poorer 
immune function than those from the HFT group, as indicated 
by the lower PHA response of LFT group females (F1,23=7.214, 
P=0.014, Figure 1B). 

Food availability treatment did not affect brood success or 
gestation period in captivity of females, but did significantly 
affect both litter size and litter mass (Table 1). The females from 
the LFT group had smaller litter sizes and, therefore, lower litter 
mass compared with those for females from the HFT group 
(Table 1). Moreover, maternal food availability did not affect the 
body size (SVL and body mass), sprint speed, or sex ratio of 
the neonates (Table 1). 


DISCUSSION 


When facing food restriction, female racerunners lay fewer 
offspring with unchanged body size and locomotor performance, 
but at a cost in terms of poor postpartum body condition and 
immune function. These maternal and offspring responses to 
food restriction have interesting implications for our 
understanding of the reproductive strategies of lizards under 
temporally fluctuating food abundance in nature. 

A female must decide how to allocate limited resources to the 
processes of self-maintenance and reproduction (Du, 2006; 
Hegemann et al., 2013; Itonaga et al., 2012a). Obviously, 
female racerunners from the LFT group produced neonates 
with a similar body size to those produced by females from the 
HFT group, but at the cost of poor body condition and immune 
function. Understandably, lizards adopt a strategy of keeping 
reproductive investment relatively constant and sacrificing other 
requirements like those of immune function and energy storage 
under constrained food conditions, because maintaining a 
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Table 1 Reproductive traits of Eremias multiocellata under high- and low-food availability treatment 


Variable 


Treatment 





Low food 


High food 





Adult body sizes 

Male SVL (mm) 

Male mass (g) 

Female SVL (mm) 

Female mass (g) 
Reproductive parameters 
Brood success 

Gestation period in captivity (days) 
Litter size (n) 

Litter mass (g) 

Neonate traits 

Neonate SVL (mm) 
Neonate mass (g) 

Sprint speed (m/s) 


Sex ratio (male%) 


63.19+0.65 (n=8) 
6.9440.25 (n=8) 
66.08+0.38 (n=24) 
7.2440.16 (n=24) 


53.33% (n=15) 
41.5841.28 (n=12) 
2.6740.27 (n=12) 
1.5640.17 (n=12) 


30.15+0.33 (n=12) 
0.59+0.02 (n=12) 
0.85+0.13 (n=8) 
42.1048.50 (n=8) 


62.9440.65 (n=8) 
7.1840.25 (n=8) 
66.1640.38 (n=24) 
7.2840.16 (n=24) 


81.25% (n=16) 
42.0041.19 (n=14) 
3.7140.25 (n=14) 
2.16+0.16 (n=14) 


29.52+0.30 (n=14) 
0.5840.02 (n=14) 
0.9540.10 (n=13) 
51.10£11.30 (n=13) 


F4, 44=0.740, P=0.789 
F4, 14=0.435, P=0.520 
F4, 4g=0.020, P=0.889 
F4, 4g=0.041, P=0.841 


x’=1.631, P=0.202 

Fi, 24=0.057, P=0.814 
Fa, 24=7.907, P=0.010 
Fi, 24=6.650, P=0.016 


F4, 24=1.991, P=0.171 
F4, 24=0.008, P=0.930 
F4, 19=0.355, P=0.558 
F4, 19=0.397, P=0.536 


Values are expressed as means+SE. One-way ANOVA and Fisher's Exact Test were used to compare the between-treatment differences in 
reproductive traits. Brood success=number of females producing live neonates/total number of females. Neonate traits (except for sex ratio) were 


calculated as clutch means to avoid pseudo-replication. SVL, snout-vent length. 
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Figure 1 Effects of food availability treatment on female 
postpartum body condition (A) and immune function (B) 
Data are summarized as meanst+SE. Immune function represents 
cellular immune response assessed by phytohemagglutinin (PHA)- 
induced skin swelling within 12 h of PHA application. LFT, low-food 
availability treatment; HFT, high-food availability treatment. 
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competent immune system is a nutritionally demanding process 
that necessitates trade-off decisions with competing nutrient 
demands, such as those of growth, reproduction, and thermal 
regulation (Sheldon & Verhulst, 1996; Uller et al., 2006; 
Zamora-Camacho et al., 2016). This strategy of females giving 
priority to reproduction rather than to self-maintenance has also 
been reported in other reptiles and vertebrate species; for 
example, the body condition of female jacky dragons 
(Amphibolurus muricatus) fed a low-quality diet declined 
dramatically throughout the reproductive season, whereas their 
hatchlings were larger and in better body condition than those 
produced from females fed a high-quality diet (Warner et al., 
2007). Such a life strategy helps the reproduction and 
recruitment of populations, but may significantly weaken the 
immune response and, therefore, the survival of females via 
elevated risks of predation or starvation in their natural habitat 
(Iglesias-Carrasco et al., 2016; Neuman-Lee et al., 2015). 

The pre-breeding nutritional condition of females is closely 
related to the production of neonates in oviparous vertebrates, 
with reduced fecundity under conditions of food limitation (Du, 
2006; Donelson et al., 2008; Johnson et al., 2014; Lehman & 
Smith, 1988; Warner & Lovern, 2014). In response to 
decreased resources, the females of multiple-clutch species 
can reduce their reproductive output by decreasing litter size 
(Sun et al., 2002), offspring size (Abell, 1999), or reproductive 
frequency (Du, 2006). The viviparous racerunners in our study 
produced one litter per reproductive season, and decreased 
their litter size rather than offspring size in response to food 
restriction. This reproductive strategy reflects the highly critical 


reproductive investment for each hatchling, suggesting that an 
optimal neonate body size could exist independent of nutritional 
conditions. This is consistent with the optimal offspring size 
theory in which females reproducing in a given environment 
divide available resources into optimally-sized offspring 
(Sinervo & Licht, 1991; Smith & Fretwell, 1974). Nonetheless, 
further studies on the relationship between initial body size and 
fitness of neonates are needed to verify this hypothesis. At the 
same time, the reduction in clutch size under conditions of low 
food availability could be an effective strategy to improve 
offspring fitness. Fewer offspring means more abundant food 
per neonate in an environment with finite resources, which will 
reduce competition and maximize the survival and reproduction 
of descendants (Bartlett, 1988). 

The traditional sex allocation theory generates two conflicting 
predictions on how maternal nutrition affects the offspring sex 
ratio (Hamilton, 1967; Trivers & Willard, 1973). Females may 
produce higher numbers of male offspring under low-food 
conditions because males might have higher fitness or are 
more likely to disperse under stressful food conditions 
(Komdeur et al., 1997; Warner et al., 2007). Alternatively, the 
opposite pattern may occur, because producing more 
daughters under low-food conditions will gain greater fitness 
return given that offspring fitness is less dependent on body 
size (hence, reproductive energy input) in daughters than in 
sons (Krist & Munclinger, 2015; Trivers & Willard, 1973). In the 
current case, however, females did not adjust the sex ratio of 
their offspring, giving no support to either prediction above. 

Overall, female viviparous lizards may sacrifice their own 
health to produce high-quality offspring and maintain the sex 
ratio, which could maximize their reproductive success and 
individual fitness when experiencing low food availability. In 
addition to maternal and offspring responses to environmental 
factors like temperature and precipitation (Ma et al., 2014; 
Wang et al., 2016), the flexibility of maternal responses to 
variable food availability represents an important life strategy 
that might enhance the resistance of lizards to unpredictable 
environmental change. Understanding maternal and offspring 
responses to the combined impact of these biotic and abiotic 
factors is a considerable challenge and should be of great 
interest for future investigations. 
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ABSTRACT 


Viral vector transfection systems are among the 
simplest of biological agents with the ability to 
transfer genes into the central nervous system. In 
brain research, a series of powerful and novel gene 
editing technologies are based on these systems. 
Although many viral vectors are used in rodents, 
their full application has been limited in non-human 
primates. To identify viral vectors that can stably and 
effectively express exogenous genes within non- 
human primates, eleven commonly used recombinant 
adeno-associated viral and lentiviral vectors, each 
carrying a gene to express green or red fluorescence, 
were injected into the parietal cortex of four rhesus 
monkeys. The expression of fluorescent cells was 
used to quantify transfection efficiency. Histological 
results revealed that recombinant adeno-associated 
viral vectors, especially the serotype 2/9 coupled 
with the cytomegalovirus, human synapsin l, or 
Ca”*/calmodulin-dependent protein kinase Il 
promoters, and lentiviral vector coupled with the 
human ubiquitin C promoter, induced higher expression 
of fluorescent cells, representing high transfection 
efficiency. This is the first comparison of transfection 
efficiencies of different viral vectors carrying different 
promoters and serotypes in non-human primates 
(NHPs). These results can be used as an aid to 
select optimal vectors to transfer exogenous genes 
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into the central nervous system of non-human 
primates. 


Keywords: Recombinant adeno-associated virus; 
Lentivirus; Rhesus monkey; Central nervous system 


INTRODUCTION 


The ability to manipulate the expression of genes in neurons 
using viral vectors allows for the transfer of exogenous genes 
into the central nervous system (CNS), thus providing a new 
approach to probe the neurological and mechanistic bases for 
brain functions and disorders (Costantini et al., 2000; Davidson & 
Breakefield, 2003). Recently, a range of novel and 
sophisticated transgenic tools have been used in CNS studies, 
including CRISPR/Cas9 and optogenetics technologies (Fenno 
et al., 2011; Ran et al., 2013). These gene editing technologies 
require viral vector transfection systems to transfer exogenous 
genes into various types of neurons, glias, and other cell types. 
To date, nine viral vector species have been used in CNS 
studies (Davidson & Breakefield, 2003). Among them, adeno- 
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associated virus (AAV) and lentivirus vectors are the most 
advantageous due to their simplicity, low pathogenicity, and 
relative ease of production (Wong et al., 2006). 

AAV is a non-pathogenic member of the parvoviridae family 
belonging to the dependovirus genus, and induces weak host 
immune response (Bennett, 2003; Mingozzi & High, 2013; 
Nayak & Herzog, 2010; Xiao, 2003). This virus consists of a 
single-stranded genome and can deliver gene cassettes of 
approximately 4.7 kilobases (Carter & Samulski, 2000). AAV is 
a replication-defective virus and requires co-infection with a 
helper virus, such as adenovirus, to propagate, providing further 
safety for its application (Xiao, 2003). Moreover, AAV vectors 
can be purified to the highest titer in comparison to other 
viruses, which is advantageous because higher titers often 
result in higher transfection efficiencies (Davidson & 
Breakefield, 2003). In addition, AAV is a highly stable virus 
that is resistant to detergents, proteases, and organic solvents, 
and therefore is not affected by pH or temperature changes 
(Xiao, 2003). AAV also has many serotypes for transfecting 
different cell types (Burger et al., 2004; Schmidt et al., 2008; 
Xiao et al., 1999). For example, AAV1 is used to transfect 
skeletal muscle cells (Bish et al., 2008), AAV4 is used for 
transfecting ependymal cells (Davidson et al., 2000), and 
AAV8 is used for transfecting liver cells (Gao et al., 2002). 
Previous studies have shown that AAV1, 2, 5, 8, and 9 can 
transfect neurons and glia cells into various brain regions 
(Bartlett et al., 1998; Burger et al., 2004; Cearley & Wolfe, 
2006; Gao et al., 2002; Towne et al., 2010), with their 
transfection efficiencies compared in this study. 

It has been difficult to directly compare tropisms among 
various AAV serotypes because different Rep and Cap genes 
can lead to different tropisms (Burger et al., 2004; Taymans et 
al., 2007; Zincarelli et al., 2008). Therefore, to efficiently 
compare the transfection ability of certain serotypes in monkey 
brains, cross-packaged virions, such as AAV2/8 or AAV2/9, 
were utilized in this experiment (Rabinowitz et al., 2002). These 
virions were packaged with Rep genes from serotype 2 and 
Cap genes from another serotype. This allowed us to compare 
the effects of the Cap genes, while excluding the impacts of the 
Rep gene (Burger et al., 2004; Taymans et al., 2007; Zincarelli 
et al., 2008). As such, recombinant adeno-associated virus 
(rAAV) serotypes were chosen for this study, and included 
rAAV2/2, rAAV2/5, rAAV2/8, and rAAV2/9. 

Lentiviral vectors are derived from human immunodeficiency 
virus type 1 (HIV-1) and are often used to broaden the 
spectrum of transfection by containing a vesicular stomatitis 
virus G-protein (Cronin et al., 2005; Costantini et al., 2000; Lai 
& Brady, 2002; Naldini et al., 1996). Lentiviral vectors have a 
large genetic capacity (10 kb) and can integrate stably into the 
host cell genome of non-dividing cells such as neurons and in 
hematopoietic stem cells (Arhel et al., 2006; Lewis et al., 1992; 
Wiznerowicz & Trono, 2005). These features have helped 
achieve highly successful gene transfer because lentiviruses 
have the ability to transfect post-mitotic neurons in the CNS, 
which are generally considered refractory to infection (Cockrell & 
Kafri, 2007; Lewis et al., 1992; Miller et al., 1990; Wiznerowicz & 
Trono, 2005). 


Researchers have shown growing interest in the application 
of new gene editing technologies in non-human primates 
(NHPs) as the differences between rodents and humans in 
brain size, structure, and function, as well as cell type and 
neural circuitry (Jennings et al., 2016), make brain disorder 
research on rodents less effective and often not directly 
applicable to humans (Okano et al. 2012). NHPs are 
considered ideal experimental animals for human disease 
modeling due to their high genetic similarity (Kirik et al., 2002). 
Moreover, NHPs are like humans in overall brain structure and 
social group living, thus providing a better reflection of the 
occurrence and development of human brain disorders (Shultz 
et al., 2011). Although lentiviral and AAV vectors are the most 
attractive genetic manipulation tools in developing transgenic 
models, their application has been primarily restricted to 
rodents (Aschauer et al., 2013; Weinberg et al., 2013), with 
many mature transgenic rodent models used in fundamental 
research and clinical trials (Elder et al., 2010; Kaplitt et al., 1994; 
Orth & Tabrizi, 2003; Ridet et al., 2006; Yang et al., 2013). 
However, similar transfection systems have been rarely used in 
NHPs. Since 2014, when the CRISPR/Cas9 with lentiviral 
vector system was first applied to NHPs (Niu et al., 2014), only 
a few transgenic monkeys have been developed (Liu et al., 
2016;Niu et al., 2014; Niu et al., 2015; Zhang, 2017) and have 
not fully simulated the pathological symptoms of human 
disease, thereby limiting their widespread use. Different viruses, 
promoters, serotypes, and target brain regions affect the 
transfection efficacy of gene transfer (Scheyltjens et al., 2015). 
Therefore, another major hurdle in the application of AAV and 
lentiviruses to NHPs is the deficient knowledge on the 
identification of viral vectors that can lead to efficient and long- 
lasting expression in specific brain regions. Therefore, the full 
application of AAV and lentiviral vectors in NHPs remains a 
challenge (Watakabe et al., 2015). To date, most research has 
focused on the effects of different AAV serotypes in specific 
brain areas, such as the substantia nigra or corpus striatum 
(Markakis et al., 2010; Gerits et al., 2015; Watakabe et al., 
2015). In the present study, we compared commonly used AAV 
and lentiviral vectors with different promoters and serotypes in 
the parietal cortex of rhesus monkeys to determine the optimal 
vector to deliver transgenes into the neocortex of NHPs. Eight 
commonly used AAV vectors and three lentiviral vectors with 
different promoters and serotypes were used. 


MATERIALS AND METHODS 

Subjects 

Four male adult rhesus monkeys (Macaca mulatta) weighing 5— 
9 kg and ranging in age from 10 to 15 years (mean+SE) were 
used as study subjects. The monkeys were housed in individual 
cages under typical conditions (temperature 21+2 °C, 14/10 h 
light/dark cycle, light from 0700h to 2100h, humidity 60%) and 
fed with standard monkey chow and daily supplements of fruit 
(Yang et al., 2014). All animals were handled in accordance 
with the Kunming Institute of Zoology policies for use of 
laboratory animals under the National Care and Use of Animals 
guidelines approved by the Chinese National Animal Research 
Authority, in conformance with international guidelines for the 


Zoological Research 38(2): 88-95, 2017 89 


ethical use of animals. 


Viral vectors 

Details of the lentivirus and AAV viral vectors used in this study 
are listed in Table 1. Viruses ID1, ID2,ID4, ID10, ID11 were 
purchased from Obio Technology Co., Ltd. (Shanghai, China), 
viruses ID3 and ID7 were purchased from BrainVTA 
Technology Co., Ltd. (Wuhan, China), ID5 and ID8 from our lab 
and all other vectors were provided by partner laboratories (ID6 
from Dr. Li-Ping Wang at the Shenzhen Institute of Advanced 


Table 1 Details of the 11 viral vectors used 


Technology, Chinese Academy of Sciences, Shenzhen, China, 
and ID9 from Dr. Zi-Long Qiu at the Shanghai Institutes for 
Biological Sciences, Chinese Academy of Sciences, Shanghai, 
China). All viruses carried the enhanced green fluorescent 
protein (eGFP) or mCherry gene sequence. The viruses were 
diluted in sterile phosphate buffered saline (PBS) to match their 
titers. The titers for the AAVs were 3.2x10'*-4.3x10" vg/mL 
and for the lentiviruses were 4.6x10® TU/mL. Expressions of 
the eGFP or mCherry proteins by neurons were used to 
indicate transfection efficiency. 





Virus ID Exogenous gene Promoter Type Serotype 
ID1 eGFP CMV rAAV AAV2/9 
ID2 eGFP CMV rAAV AAV2/2 
ID3 mCherry hSyn rAAV AAV2/9 
ID4 eGFP hSyn rAAV AAV2/8 
ID5 eGFP CAG rAAV AAV2/8 
ID6 eGFP CaMKII rAAV AAV2/5 
ID7 mCherry CaMKII rAAV AAV2/9 
ID8 eGFP hSyn rAAV AAV2/5 
ID9 eGFP UbC Lentivirus None 
ID10 eGFP hSyn Lentivirus None 
ID11 eGFP CMV Lentivirus None 


Abbreviations: hybrid CMV-chicken B-actin (CAG), cytomegalovirus (CMV), human ubiquitin C (UbC), human synapsin | (hSyn), Ca?*/calmodulin- 
dependent protein kinase Il (CaMKII) (Alexopoulou et al., 2008; Gruh et al., 2008; Kiigler et al., 2003; Mayford et al., 1996; Schorpp et al., 1996; 


Wilhelm et al., 2011). 


Surgery 

Prior to surgery, all monkeys were anesthetized with atropine 
(0.03—0.05 mg/kg, i.m.), then injected with ketamine (10 mg/kg, 
i.m.) and pentobarbital (30 mg/kg, i.m.). All surgical procedures 
were conducted under strict aseptic conditions. As a 
prophylactic antibiotic treatment, cephalosporin was injected for 
three consecutive days after surgery (25 mg/kg/day, i.m., once 
a day). Monkeys were placed in a RWD stereotaxic frame 
(Product Model: 68902, Shenzhen, China) and two pieces of 
bone from the skull above the parietal cortex (~2 cmx1 cm each) 
were removed. The dura was carefully stripped to expose the 
brain parenchyma and the vectors were then infused through a 
31-gauge Hamilton syringe placed in a syringe pump (WPI 
Apparatus, Sarasota, USA) attached to a stereotaxic instrument. 
Vectors were infused bilaterally into the brain using the 
following stereotaxic coordinates: (Injection Site 1 and Site 2): 
flat skull-anterior-posterior +25.05 mm, medial-lateral +8 mm, 
and dorsal-ventral 3 mm from parenchyma surface, interaural=0; 
(Injection Site 3 and Site 4): anterior-posterior +10.05 mm, 
medial-lateral +8 mm, and dorsal-ventral 3 mm from 
parenchyma surface, interaural=0. A map of the injection 
sites is shown in Figure 1. The vectors (5 uL) were infused 
at a rate of 400 nL/min. Following infusion, the needle was 
left in place for 8 min before being slowly retracted from the 
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brain. After infusion, the dura was sutured and the bones 
were put back. 

The four monkeys were injected with one vector per injection 
site. Viruses (ID1, 2, and 4) were injected into Monkey 1 (Site 1, 
Site 2, and Site 3), viruses (ID5, 6, 8, and 9) were injected into 
Monkey 2, viruses (ID10 and 11) were injected into Monkey 3 
(Site 1 and Site 3), and viruses (ID3 and 7) were injected into 
Monkey 4(Site 1 and Site 3). Blank sites were injected with 
other viruses (data not shown). The distances between the 
injection sites were greater than 15 mm to prevent diffusion of 
different viruses into one another. 


Histology and microscopy 

Six weeks after the viral vectors were injected, the monkeys 
were anesthetized with pentobarbital and _ transcardially 
perfused with 1 000 mL of PBS and 500 mL of 4% 
paraformaldehyde. The brains were then removed from the 
skull and fixed with 4% paraformaldehyde for 5d. Afterwards, 
the brains were equilibrated in 20%—30% sucrose until they 
sank. The frozen brains were sliced (40 um) in a Leica Cryostat 
(CM1850 UV; Solms, Germany). The fluorescent signals were 
strong enough to be detected without enhancement by 
immunodetection, and were observed by inverted fluorescence 
microscopy (Leica DMI 6000B; Camera: Leica DFC 290; 
Optical master: Leica LTR6000, Wetzlar, Germany). 





Figure 1 Distribution map of the injection sites on the rhesus 
monkey brain 

The injection sites are referenced from “rhesus monkey brain in 
stereotaxic coordinates” (Paxinos et al., 2000). 


Statistical analysis 

For measurement of the eGFP or mCherry positive cells, a 
circle was drawn to cover the center of the positive cells. At the 
center of the circle a single optical section was captured using a 
10x objective lens (Leica DMI 6000B; Camera: Leica DFC 290; 
Optical master: Leica LTR6000, Wetzlar, Germany). A region of 
interest (ROI, about 1.1 mmx1.1mm) was selected from the 






image. Morphologically intact neurons were counted. 
Fluorescence-positive cells were averaged by counting four 
slices from each viral injection site. Statistical relationships were 
determined using GraphPad (version 5.01) with one-way 
analysis of variance with Bonferroni multiple comparison tests. 
The level of significance was set at P<0.05, and all data were 
presented as means+SE. 


RESULTS 


In the present study, eight AAV and three lentiviral vectors were 
tested in the rhesus monkey brain. Among these viral vectors, 
four AAV vectors and one lentiviral vector successfully 
transfected the brain. 

Of the eight AAV vectors, AAV2/9 (vector ID1) and 
AAV2/2 (vector ID2) carrying the CMV promoter, as well as 
AAV2/9 carrying the hSyn (vector ID3) and CaMKII (vector 
ID7) promoters, successfully transfected the brain (Figure 2). 
It is worth noting that AAV2/9 with the hSyn and CaMKII 
promoter was successfully transfected, whereas AAV2/5 
and AAV2/8 with the same promoters failed. Additionally, 
AAV2/9 with the CMV promoter was also successfully 
transfected. The number of transfected neurons and the 
cellular morphology, as judged by the fluorescence signals, 
were compared for AAV2/9 coupled with three different 
promoters (ID1, ID3, and ID7). The neurons transfected with 
AAV2/9 vectors (ID1, ID3, and ID7) displayed a strong 
fluorescence signal in distal and thin dendrites (Figure 2). 
Statistical analysis revealed that the numbers of fluorescent 
positive neurons transfected with AAV2/9 with different 
promoters were not significantly different from each other 
(Figure 3). Thus, we inferred that the AAV2/9 serotype might 
be an ideal candidate serotype, and could be coupled with 
many commonly used promoters, resulting in high 
transfection efficacy in NHPs. 


ID1 ID2 ID3 ID4 
IDS ID6 ID7 ID8 


Figure 2 Expression of eGFP (ID1, ID2, ID4, ID5, ID6 and ID8) or mCherry (ID3 and ID7) using different viral vectors injected into the 


parietal cortex of the rhesus monkeys 


Brains were collected at six weeks post-injection. Vectors successfully expressed eGFP (ID1 and ID2) or mCherry (ID3 and ID 7). Scale bar, 100 um. 
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Figure 3 Comparison of transfection efficiencies of different viral 
vectors in the parietal cortex of rhesus monkeys 

Transfection efficiency was quantified by the number of eGFP or 
mCherry positive cells. Brains were collected six weeks post-injection. 
There were no significant differences among AAV ID1 (Nslides=4), AAV 
ID2 (Nslides=4), AAV ID3 (Nslides=4), and AAV ID7 (Nslides=4) (all P>0.05). 
Data are presented as means+SE. 


Details of the AAV vectors are reported in Table 2. The 
eGFP or mCherry expression profiles of the AAV vectors are 
shown in Figure 2. 


Table 2 Details of AAV vectors, including promoters and serotypes, 
and their expression 





Virus ID Promoter Serotype Expressed 
ID1 CMV. AAV2/9 Yes 

ID2 CMV. AAV2/2 Yes 

ID3 hSyn AAV2/9 Yse 

ID4 hSyn AAV2/8 No 

ID5 CAG AAV2/8 No 

ID6 CaMKII AAV2/5 No 

ID7 CaMKII AAV2/9 Yes 

ID8 hSyn AAV2/5 No 


Of the tested lentiviral vectors, only one with the UbC 
promoter was successfully transfected into the monkey brain. 
This suggests that the UbC promoter might have a strong effect 
at enhancing lentivirus expression in NHPs. Details of the 
lentiviral vector transfections are reported in Table 3. The 
eGFP expression profiles of the three lentiviruses are shown 
in Figure 4. 


DISCUSSION 


NHPs are important experimental animals in the development 
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Table 3 Lentiviral vector expression in the rhesus monkey brain 
(vector ID7 successfully expressed eGFP in the monkeys) 





Virus ID Promoter Type Expressed 
ID9 UbC Lentiviruses Yes 
ID10 hSyn Lentiviruses No 
ID11 CMV Lentiviruses No 
ID9 ID10 ID11 





Figure 4 Expression of eGFP six weeks after intracranial injection 
of lentiviral vectors 

Lentiviral vectors expressing eGFP were injected into the parietal cortex 
of the rhesus monkeys. Brains were collected six weeks post-injection. 
Vector ID7 was successfully transfected. Scale bar, 100 um. 


of genetic animal models of human diseases. The potential 
application of viral technologies to develop genetic models 
makes monkeys ideal candidates for therapeutics research 
(Jennings et al., 2016; Samaranch et al., 2012). However, only 
a few studies have compared the transfection efficiencies of 
different viral vectors in NHPs. As yet, most research has 
focused on the comparison of different AAV serotypes (Davidoff 
et al., 2005; Dodiya et al., 2010; Gerits et al., 2015; Lawlor et al., 
2009; Watakabe et al., 2015), with no studies comparing the 
efficiency of AAV and lentivirus transfection with commonly 
used promoters and serotypes in NHPs. The goal of the current 
research was to identify the best candidates from the most 
commonly used AAV and lentiviral vectors to improve the 
success rate of NHP genetic models. Recombinant AAV and 
lentiviral vectors containing fluorescent protein genes were 
injected to the parietal cortex of four rhesus monkeys, with the 
expression of fluorescent cells then used to quantify 
transfection efficiency. We found that the CMV promoter within 
AAV2/2 and the UbC promoter within lentivirus resulted in 
higher expression of fluorescent cells. The AAV2/9 vector 
coupled with CMV, hSyn, or CaMKII promoters led to similar 
and high expression of fluorescent cells. 

Our results demonstrated that AAV2/9 exhibited higher 
transfection efficiency than that of AAV2/5 or AAV2/8 in the 
CNS of rhesus monkeys, although they carried the same 
promoters. Furthermore, AAV coupled with the CMV or hSyn 
promoters showed successful transfection, but the lentivirus 
with these two promoters failed. Considering that AAV and 
lentivirus belong to different virus genera, the production 
processes and titration methods were different. Therefore, the 
transfection efficacies between the AAV and lentivirus were not 
compared in the present study. 

Selecting a suitable promoter is a key factor in achieving 


successful transfection (Davidson & Breakefield, 2003). In this 
research, CMV, hSyn, CaMKlla, CAG, and UbC were 
evaluated as alternative promoters. The CMV promoter was 
found to have higher transcription activity in AAV. It is worth 
noting that although the CMV promoter led to higher 
transcription activities in neurons, it has also been found to 
transcribe genes in other cell types, such as glial cells (Dittgen 
et al., 2004; Miyoshi et al., 1998; Zufferey et al., 1997). 
Therefore, the application of the CMV promoter is limited to 
research that solely focuses on neurons. The hSyn and 
CaMKlla promoters are known to activate only neurons (Hioki 
et al., 2007), and had higher transcription activities with AAV2/9 
than with AAV2/5 or AAV2/8. In addition, AAV2/9 coupled with 
the CMV promoter also resulted in high expression, suggesting 
that AAV2/9 can be used in neuron-type specific or non-specific 
transfection, thus making it an ideal candidate vector in NHP 
brain research. Concerning the lentivirus vectors, only the UbC 
promoter resulted in successful transfection. The UbC promoter 
regulates the expression of transgenes in a wide range of cells, 
as suggested by the expression of ubiquitin proteins in all 
eukaryotic cells, and is also able to confer high protein 
expression in neurons, whereas other promoters, including 
CMV, exhibit reduced activity (Schorpp et al., 1996; Wilhelm et 
al., 2011). 

In conclusion, this study identified optimal viral vectors for 
gene transfer in NHP brains. Eight recombinant AAV and three 
lentiviral vectors, which are widely used in rodent brain 
research, were injected into the parietal cortex of rhesus 
monkeys, with the expression of fluorescent cells then used to 
quantify transfection efficiency. Six weeks after the injection, 
one lentiviral vector and four AAV vectors were found to have 
successfully induced the expression of fluorescent cells in the 
monkey brains, representing successful transfection. This study 
also evaluated the specific promoters and serotypes contained 
within the tested vectors and found that AAV2/2 coupled with 
the CMV promoter and lentivirus coupled with the UbC 
promotor resulted in successful transfection. Further analysis 
found that AAV2/9 coupled with different promoters led to high 
and stable transcription activities in the brains of rhesus 
monkeys, suggesting that AAV2/9-based vectors containing 
different promoters (CMV, hSyn, and CaMKIlo) might be ideal 
for NHP genetic modeling and brain research. 
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ABSTRACT 


Brain development and aging are associated with 
alterations in multiple epigenetic systems, 
including DNA methylation and demethylation 
patterns. Here, we observed that the levels of the 5- 
hydroxymethylcytosine (ShmC) ten-eleven translocation 
(TET) enzyme-mediated active DNA demethylation 
products were dynamically changed and involved in 
postnatal brain development and aging in tree 
shrews (Tupaia belangeri chinensis). The levels of 
5hmC in multiple anatomic structures showed a 
gradual increase throughout postnatal development, 
whereas a significant decrease in 5hmC was found 
in several brain regions in aged tree shrews, 
including in the prefrontal cortex and hippocampus, 
but not the cerebellum. Active changes in Tet mRNA 
levels indicated that TET2 and TET3 predominantly 
contributed to the changes in 5hmC levels. Our 
findings provide new insight into the dynamic 
changes in 5hmC levels in tree shrew brains during 
postnatal development and aging processes. 


Keywords: Tree shrew; DNA demethylation; 5- 
hydroxymethylcytosine; Brain development and aging 


INTRODUCTION 


Epigenetic systems emphasize the heritable changes in gene 
expression that do not involve coding sequence modifications, 
e.g., DNA methylation, histone modification and chromatin 
remodeling, and non-coding RNA regulation. These types of 
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changes are proposed to be responsible for controlling the 
expression and function of genes and have emerged as 
important mediators of development and aging (Abel & Zukin, 
2008; Agis-Balboa et al., 2013; Alagiakrishnan et al., 2012; 
Bakulski et al., 2012; Barbash & Soreq, 2012; Bihaqi et al., 
2012; Coppieters & Dragunow, 2011; Deaton & Bird, 2011). 
Complex disorders, such as cardiovascular disease, cancer, 
diabetes, and neuropsychiatric and neurodegenerative 
diseases (Kinney & Pradhan, 2013; Konsoula & Barile, 2012; 
Kudo et al., 2012; Kwok, 2010; Maekawa & Watanabe, 2007; 
Marques et al., 2011; Urdinguio et al., 2009), have multifactorial 
origins, depending not only on genetic but also on environmental 
factors. The most extensively studied neurological disorder 
regarding epigenetic changes is Rett syndrome. Patients with 
Rett syndrome exhibit neurodevelopmental defects associated 
with MeCP2 mutations, which encode methyl CpG binding 
protein 2 and bind to methylated DNA (Ballestar et al., 2000; 
Hansen et al., 2010). Other mental retardation disorders are 
also linked to the disruption of genes involved in epigenetic 
mechanisms, e.g., alpha thalassemia/mental retardation X- 
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linked syndrome, Rubinstein-Taybi syndrome, and Coffin-Lowry 
syndrome (Urdinguio et al., 2009). Aberrant DNA methylation 
followed by abnormal gene expression was first reported as an 
epigenetic hallmark in cancer (Haffner et al., 2011; Malzkorn et 
al., 2011). The cytosine base in situ to produce 5- 
methylcytosine (5mC) is catalyzed by DNA methyltransferases 
(Lian et al., 2012). It is estimated that 5mC accounts for 2%-8% 
of total cytosines in human genomic DNA and affects a broad 
range of biological functions, including gene expression, 
genome integrity maintenance, parental imprinting, X- 
chromosome inactivation, development regulation, aging, and 
cancer(Guo et al., 2011; Lian et al., 2012; Szwagierczak et al., 
2010). Ten-eleven translocation (TET) proteins, which are the 
enzymes necessary for demethylating 5mC, display enzymatic 
activity for the conversion of 5mC to 5-hydroxymethylcytosine 
(5hmC) (Dahl et al., 2011; Kudo et al., 2012). This oxidized form 
of 5mC is known as the “sixth base”. Its presence adds a layer 
of complexity to the epigenetic regulation of DNA methylation 
(Ito et al., 2010; Tahiliani et al., 2009) and it is found in a variety 
of mammalian cells, particularly in self-renewing and pluripotent 
stem cells and neuronal cells (Guo et al., 2011; Szwagierczak 
et al., 2010). Although the biological role of 5hmC is not fully 
understood at present, it has gained attention due to its cancer 
biomarker potential (Davis & Vaisvila, 2011). It is assumed that, 
like 5mC, 5hmC also plays an important role in switching genes 
on and off. 

It is important to illuminate the differences in 5ShmC 
expression in different tissues and cells (Almeida et al., 2012a, 
b). Of note, 5hmC is particularly abundant in the brain (Globisch 
et al., 2010; Kriaucionis & Heintz, 2009; Li & Liu, 2011), and 
accounts for approximately 40% of modified cytosine, 
suggesting a potential role in neuronal plasticity (Szulwach et 
al., 2011). Low levels of 5hmC are reportedly associated with 
brain tumor differentiation and anaplasia (Kraus et al., 2012; 
Lian et al., 2012). In the mouse cortex and cerebellum, 5hmC is 
enriched within genes and appears to promote gene 
transcription (Jin et al., 2011; Song et al., 2011). As an 
intermediate state of complete DNA demethylation, 5hmC 
affects functional demethylation by blocking transcriptional 
repressor binding (Guo et al., 2011; He et al., 2011; Ito et al., 
2011; Zhang et al., 2012). Synchronous neuronal activity 
promotes active DNA demethylation of plasticity-related genes 
in the mouse brain through TET-mediated formation of 5hmC 
(Guo et al., 2011). However, it is not clear whether such 
demethylation solely accounts for the enrichment of 5hmC in 
the brain. Recent research has reported that DNA methylation 
dynamics influence brain function and are altered in 
neurological disorders (Chia et al., 2011). Genome-wide 
mapping of 5hmC in the mouse hippocampus and cerebellum 
at different developmental stages indicate an increase with age 
and gene expression related enrichment in genes implicated in 
neurodegeneration (Szulwach et al., 2011). In addition, our lab 
recently revealed that selective changes in 5hmC-mediated 
epigenetic regulation promote Purkinje cell neurodegeneration 
in Ataxia-Telangiectasia disease and regulate DNA damage 
response (Jiang et al., 2015, 2017). 

The tree shrew (Tupaia belangeri chinensis) belongs to Order 


Scandentia and is similar to primates in many biological 
features (Fan et al., 2013; Rockland & Lund, 1982; Xu et al., 
2012). Moreover, its high brain-to-body mass ratio makes it a 
promising non-human primate animal model in brain and 
biomedical research (Cao et al., 2003; Li et al., 2017). In the 
present study, we sought to define dynamic changes in 5hmC 
levels in the tree shrew brain during postnatal brain 
development and aging, as well as its potential epigenetic 
regulation. During postnatal development, the levels of 5hmC 
showed a gradual increase in multiple anatomic structures of 
the tree shrew brain. In contrast, a significant loss of 5hmC 
was found in several selective regions of the aged tree shrew 
brains. Thus, dynamic changes in 5hmC levels suggest its 
potential epigenetic regulation in the process of brain 
development and aging. 


MATERIALS AND METHODS 


Animal use and care 

Wild-type tree shrews were obtained from the Kunming Primate 
Research Center, Kunming Institute of Zoology, Chinese 
Academy of Sciences. All experimental procedures and animal 
care and handling were performed per the protocols approved 
by the Institutional Animal Care and Use Committee of the 
Kunming Institute of Zoology, Chinese Academy of Sciences. 
For 5hmC immunohistochemistry, P10-, 3-month, 1-, 2-, 4-, and 
5-year-old age-matched groups of animals (three males and 
three females) were used; for dot-blot assay, 3-month, 1-, 2-, 
and 4-year-old age-matched groups of animals (three males 
and three females) were used; for Tet mRNA quantitative real- 
time PCR, P10-, 3-month, 1-, 2-, and 4-year-old age-matched 
groups of animals (three males and three females) were used. 


Tree shrew brain cryostat section preparation 

The tree shrews were anaesthetized with ketamine (0.2 mg/g, 
i.m.) and perfused with phosphate buffer saline (PBS; 137 
mmol/L NaCl, 2.7 mmol/L KCl, 10 mmol/L NazHPO.,, 2 mmol/L 
KH2PO;, pH 7.4) after complete anesthesia. After the blood was 
rinsed off from the body of animal, the perfusion buffer was 
changed to 4% paraformaldehyde (PFA) fixative solution for 20 
min. Immediately, the whole brain was dissected using scissors 
and fixed with 4% PFA overnight at 4 °C. The fixed brain tissue 
was placed in 10, 20, and 30% sucrose solution sequentially for 
dehydration until it sank, and was then completely immersed 
with embedding agent optimal cutting temperature compound 
(OCT, Ted Pella Inc., USA). Finally, 15 um cryostat brain 
sections were prepared at -25 °C and stored at -80 °C. 


Immunohistochemistry 

For 3,3'-diaminobenzidine (DAB)/bright field staining, all frozen 
brain sections were pretreated with 0.3% hydrogen peroxide in 
methanol for 30 min to inhibit endogenous peroxidase activity, 
then rinsed in Tris-buffered saline (TBS), and treated with 0.1 
mol/L citrate buffer in a microwave at sufficient power to keep 
the solution at 100 °C for 20 min. Sections were cooled in the 
same buffer at room temperature (RT) for 30 min and rinsed in 
TBS. Slides were incubated in 10% goat serum in PBS blocking 
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solution for1 h at RT, after which anti-ShmC antibody (Abcam, 
ab106918, 1:200 dilution) was applied to the sections, followed 
by incubation at 4 °C overnight. The sections were washed 
three times in TBS before applying the secondary antibody 
(Vector Laboratories) for 1 h at RT. Afterwards, sections were 
rinsed three times in TBS. Rinsed sections were then incubated 
in Vectastain ABC Elite reagent (Vector Laboratories) for 1 h 
and developed using diaminobenzidine, according to the 
manufacturer’s protocols. The sections were counterstained 
with hematoxylin, and finally mounted in Permount under a 
glass cover slip after dehydration. Control sections were 
subjected to identical staining procedures, except for the 
omission of the primary antibody. 


Dot-blot analysis 

Genomic DNA was isolated from brain tissue using a QIAamp 
DNA Mini Kit (Qiagene). Dot-blot assays were performed as 
described previously (Jiang et al., 2015). Briefly, 10 ug of 
sample DNA was diluted in Tris-EDTA (10 mmol/L, pH 8.0) 
buffer to 90 uL, with 2 uL of 5 mol/L NaOH buffer then added. 
After denaturing at 98 °C for 10 min, samples were immediately 
chilled on ice and neutralized with 10 uL of 6 mol/L (NHa)2Ac to 
a final DNA concentration at 100 ng/uL. The indicated amount 
of mixed DNA solution was spotted on a N+-Nylon membrane 
(Amersham Biosciences) and then crosslinked by UV-x-linker at 
0.5 Joule/cm?. Blots were incubated in blocking buffer (5% non- 
fat milk and 1% BSA in PBS-T) for 1 h at RT and then in 5hmC 
(Active Motif 39769) solution (1:10 000 in blocking buffer) 
overnight at 4 °C. Corresponding secondary Ab-HRP (1:5 000 
in 5% non-fat milk) was used and blots were detected with the 
ECL system following the manufacturers instructions. 
Methylene blue (0.02% in Na2Ac, pH 5.5) staining was also 
performed to measure total DNA loading. 


RNA isolation and real-time quantitative PCR (RT-qPCR) 
Total RNA was isolated from tree shrew brain tissue using Trizol 
reagent (Invitrogen) and cDNA was generated with a Fast 
Quant RT Kit (Vazyme). Real-time qPCR reactions were 
performed on an ABI PRISM 7700 Sequence Detection System 
(Applied Biosystems) using SYBR Green reagent (Invitrogen). 
Expression levels of target genes were analyzed using the 
comparative cycle threshold (Ct) method, where Ct is the cycle 
threshold number normalized to that of B-actin. The primers 
used in this study were: for Tet1, Tet1-F 5'-CGGAATCATC 
CTACACGCCT-3'/Tet1-R 5'-GGACAGTTCATTACCCTCCTTA- 
3'; for Tet2, Tet2-F 5'-ACCCCCTGAT TAATAACCCT-3'/Tet2-R 
5'-CACCTCCATAAACACCTGAC-3'; and for Tet3, Tet3-F 5'- 
CAAGGCTGAGAACCCACTCA-3'/Tet3-R 5'-CTTTCTCCACTA 
TTTGTTCGAC-3'. B-actin was applied as the standard for 
normalization by using B-actin-F 5'-TGCGTGACATCAAGGA 
GAAG-3'/B-actin-R: 5'-ACCT GACCATCAGGCAACTC-3'. 


Statistics 

All data were presented as means+SE of a minimum of three 
replicates. For most analyses, we evaluated statistical 
differences using the Student's t-test. For all analyses, P<0.05 
indicated statistical significance. 
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RESULTS 


Spatiotemporal-biased increase of 5hmC in tree shrew 
brains during postnatal development 

To address if postnatal brain development could be attributed to 
active DNA demethylation, we examined the 5hmC levels in the 
prefrontal cortex (PFC), parietal cortex (PC), occipital cortex 
(OC), hippocampus (HP), and cerebellum (CB) of P10, 3-month, 
and 1-year-old tree shrew brains using immunohistochemistry. 
Compared with P10 brains, the intensity of 5hmC in the PFC, 
PC, OC, and HP neurons showed substantial increases in the 
3-month and 1-year-old tree shrew brains, and peaked at 2- 
years old (Figure 1A,B). However, despite of high abundance of 
5hmC in CB neurons in 1-year-old animals, high populations of 
CB neurons from P10 and 3-month-old animals showed low 
intensities of 5hmC (Figure 1A,B). The high abundance of 
5hmC in adult brains (1- and 2-year-old tree shrews), but not 
developing brains (P10 and 3-month-old tree shrews), suggests 
that dynamic changes in 5hmC levels are involved in postnatal 
brain development. We further analyzed 5hmC-specific 
immunostaining data and revealed a significant increase in 
5hmC intensity from P10 to 1-year, with a 2.31+0.12-fold 
increase in the PFC (P<0.05), 2.26+0.13-fold increase in the 
PC (P<0.01), 2.1+0.16-fold increase in the OC (P<0.001), 
2.25+0.18-fold increase in the HP (P<0.01), and a 4.5+0.15-fold 
increase in the CB (P<0.001) (Figure 1B). 


Age-related decrease of 5hmC in tree shrew brains during 
aging 

Although 5hmC-mediated epigenetic regulation plays an 
important role in brain development, whether it is involved in 
brain aging remains unclear. To explore if changes in 5hmC 
levels diminished its dynamics during brain aging, we examined 
the abundance of 5hmC in the PFC, PC, OC, HP, and CB in 2-, 
4- and 5-year-old tree shrew brains. In contrast to the 
abundance of 5hmC in 2-year-old brains, the intensities of 
5hmC were substantially decreased in the PFC, PC, OC, and 
HP neurons of 4- and 5-year-old tree shrews (Figure 1A,B). In 
both the cerebral cortices and HP, we found selective loss of 
5hmC in most neurons in 4- and 5-year-old tree shrew brains. 
Nevertheless, decreased 5hmC levels were not significantly 
observed in CB neurons (Figure 1A,B). The decreased 
abundance of 5hmC in aged brains (4- and 5-years old), but not 
adult brains (2-years old), suggests that changes in 5hmC 
levels are involved in brain aging. We analyzed 5hmC-specific 
immunostaining and revealed a significant decrease in 5hmC 
intensities from 2-year-old to 4- and 5-year-old brains, with a 
1.75+0.16-fold decrease in the PFC (P<0.05), 1.36+0.14-fold 
decrease in the PC (P<0.05), 1.6+0.13-fold decrease in the OC 
(P<0.01), and 1.5+0.17-fold decrease in the HP (P<0.05), but a 
0.35+0.08-fold increase in the CB (P>0.05) (Figure 1B). 

To verify dynamic changes in 5hmC levels in tree shrew 
brains during postnatal brain development and aging, we 
determined the global changes in 5hmC levels through dot-blot 
assay. Genomic DNA from the PFC, HP, OC, PC, and CB of 3- 
month, 1-, 2-, and 4-year-old tree shrew brains was extracted 
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Figure 1 Dynamic changes in 5hmC levels in tree shrew brains during postnatal development and aging 
A: 15 um cryostat sections of P10-, 3-month-, 1-, 2-, 4-, and 5-year-old wild-type tree shrew brains were immunostained with 5hmC (brown). 
Representative images were taken of the prefrontal cortex (PFC), parietal cortex (PC), occipital cortex (OC), hippocampus (HP), and cerebellum (CB). 
Signals from 100-300 neurons from each anatomic structure were averaged and normalized to the highest intensity across the observation. Scale bar, 


200 um. B: Normalized intensity of 5hmC staining illustrated in panel A. *: P<0.05. Error bars represent SE (unpaired t-test, n=3 repetitions of the 
experiment). 


and sonicated for examination. As expected, the levels of 
5hmC increased from the P10 to the 2-year-old brains in five 
regions during postnatal brain development (Figure 2A,B). In 
contrast, the levels of 5hmC in the aged brains showed a 
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slight decrease in the PC, OC, and CB, but not in the PFC or 
HP (Figure 2A,B). This indicates that selective reduction in 
5hmC levels might play a distinct role in different brain 
compartments during aging. 
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Figure 2 Decreases in 5hmC levels in the prefrontal cortex (PFC) and hippocampus (HP) during aging confirmed by dot-blot assay 

A: Dot-blot analysis of changes in 5hmC levels in 3-month, 1-, 2- and 4-year-old wild-type tree shrew brains. Sample loadings were 3 uL, 1.5 uL, and 
0.75 uL per-sample. Images were taken of the prefrontal cortex (PFC), parietal cortex (PC), occipital cortex (OC), hippocampus (HP), and cerebellum 
(CB). B: Quantitative statistics of 5hmC illustrated in panel A. *: P<0.05. Error bars represent SE (unpaired t-test, n=3 repetitions of the experiment). 
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Differential patterns of Tet mRNA expression in tree 
shrew brains during development and aging 

The TET family enzymes, including TET1, TET2, and TET3, are 
responsible for oxidizing 5mC into 5hmC. To test if changes in 
5hmC levels were directly associated with levels of Tet MRNA 
expression in tree shrew brains during postnatal development 
and aging, we examined their expression patterns. Total RNA 
from the PFC, HP, OC, PC, and CB of P10-, 3-month, 1-, 2-, 
and 4-year-old tree shrew brains were extracted, followed by 
RT-qPCR. In the PFC, Tet3 mRNA expression increased from 
P10- to 2-years old (2.15+0.11-fold, P<0.05), and then 
decreased at 4-years old. This is consistent with the patterns of 
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changes in 5hmC levels described above. However, Tet? and 
Tet2 mRNA expressions exhibited little change during postnatal 
development in the PFC (Figure 3A). The mRNA levels of Tet3 
showed an exclusive increase in the HP (~2-fold from P10- to 
2-years old) (Figure 3B). In contrast, mRNA levels of Tet? and 
Tet2 in the HP significantly decreased during postnatal brain 
development (from P10- to 2-years old) (Figure 3B). 
Interestingly, the mRNA levels of all three Tet genes in the PC, 
and Tet2/Tet3 in the OC showed significant increases during 
postnatal development (Figure 3C,D). Lastly, while mRNA 
levels of Tet? and Tet2 were upregulated in the CB during 
postnatal development and aging, the level of Tet3 mRNA 
remained quite stable (Figure 3E). 
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Figure 3 Expression patterns of Tet genes in tree shrew brains during postnatal development and aging 

A-E: Relative mRNA levels of Tet1, Tet2, and Tet3 in the prefrontal cortex (PFC), parietal cortex (PC), occipital cortex (OC), hippocampus (HP), and 
cerebellum (CB) were determined by RT-qPCR. B-actin was used as a quantitative and qualitative control. All expression data were normalized to the 
expression of Tet? at P10. *: P<0.0. Error bars represent SE (unpaired t- test, n=3 repetitions of the experiment). 


DISCUSSION 


DNA methylation/demethylation-mediated epigenetic regulation 
is involved in the processes of brain development and aging 
(Hutnick et al., 2009; Wilson et al., 1987). Abnormal alterations 
in this regulatory system leads to changes in brain function 
(Jiang et al., 2015; Szulwach et al., 2011); however, the 
underlying mechanisms remain uncertain in many biological 
processes and disease states. Loss of DNA methylation can 
occur through passive or active demethylation (Tahiliani et al., 
2009). In active processes, the 5mC methyl group is removed 
by an enzymatic process. It is proposed that this mechanism 
requires the action of TET dioxygenases, which hydroxylate 
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5mC to ShmC so that it can be deaminated and subsequently 
repaired by the base excision repair system (Chaudhry & 
Omaruddin, 2012; Kriaucionis & Heintz, 2009). As a novel 
epigenetic hallmark, the levels of ShmC in genomic DNA vary 
significantly depending on cell type (Guo et al., 2011; 
Kriaucionis & Heintz, 2009). 

In the present study, we observed and assessed dynamic 
changes in 5hmC levels in the tree shrew brain during postnatal 
development and aging. We found that the levels of ShmC in 
the tree shrew brains increased in multiple anatomic structures 
during postnatal development, whereas a significant decrease 
in 5hmC abundance in aged brain neurons was found in the 
cerebral cortices and hippocampus, but not in the cerebellum. 
Previous research has reported that 5hmC levels increase in 


the neurons of the mouse hippocampus and cerebellum during 
postnatal brain development (Szulwach et al., 2011), similar to 
the change patterns of the tree shrew brain observed in this 
study. These findings suggest that the dynamic change in 
5hmC levels is an epigenetic hallmark and potential key factor 
in regulating the processes of brain development and aging. 
In addition, dynamic changes in 5hmC levels in different 
neural cell types are different, and its mediated epigenetic 
regulation during brain development could be different to its 
role in brain aging. 

Brain function is linked to two significant factors: (1) processing of 
normal, usually abundant, proteins, e.g., neurotransmitter 
receptors and ion channels; and (2) maturation and aging. Age 
is not disease-specific, but is a prerequisite in brain 
development and aging. Our data indicate that understanding 
abnormal alteration in 5hmC during brain aging will be a useful 
epigenetic basis to explore the consequences of aging. We 
revealed that the dynamic pattern of changes in 5hmC levels in 
tree shrew brains predicts a possible link between DNA 
demethylation and brain development and aging. This novel 
perspective significantly broadens the search for new 
explanations of the complex biology of brain development and 
aging in tree shrews. The present study will help deepen our 
understanding of whether abnormal changes in 5hmC levels 
are epigenetic hallmarks of age-related neurodevelopmental 
and neurodegenerative disorders, and how changes in 5hmC 
levels in specific types of neurons produce a broad array of 
physiological and neurological symptoms. 

We presented original data and research on the tree shrew 
brain, but recognize that our approach entails significant 
challenges. In the future, we will expand our observations in 
deciphering the mechanisms of the dynamic changes in 
5hmC levels and provide new information about how the 
5hmC-mediated regulatory system functions in brain 
development and aging. 
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ABSTRACT 


Cyclophilin D (referred to as HsCypD) was obtained 
from the freshwater pearl mussel (Hyriopsis 
schlegelii). The full-length cDNA was 2 671 bp, 
encoding a protein consisting of 367 amino acids. 
HsCypD was determined to be a hydrophilic 
intracellular protein with 10 phosphorylation sites and 
four tetratricopeptide repeat (TPR) domains, but no 
signal peptide. The core sequence region 
YKGCIFHRIIKDFMVQGG is highly conserved in 
vertebrates and invertebrates. Phylogenetic tree 
analysis indicated that CypD from all species had a 
common origin, and HsCypD had the closest 
phylogenetic relationship with CypD from Lottia 
gigantea. The constitutive mRNA expression levels 
of HsCypD exhibited tissue-specific patterns, with 
the highest level detected in the intestines, followed 
by the gonads, and the lowest expression found in 
the hemocytes. 


Keywords: Hyriopsis schlegelii, 
Sequence analysis 


Cyclophilin D; 


INTRODUCTION 


Cyclophilin is a type of intracellular receptor of cyclosporin A 
(CsA). It has a peptidyl-prolyl cis-trans isomerase (PPlase) 
region and can be combined with CsA (Feng & Xin, 2013). 
Widely found in vivo, cyclophilins have a conserved structure 
and biological function. Cyclophilin D (CypD) is a mitochondrial 
matrix protein, and plays a crucial role in protein folding, cell 
apoptosis, necrosis, and immunosuppression (Thomas et al., 
2012). The CypD protein protect cells from death that is 
induced by oxidative stress and mediated by mitochondria 
(Basso et al., 2005). It is a key factor in the regulation of the 
mitochondrial permeability transition pore (MPTP), which plays 
a role in the release of cytochrome C and other apoptotic 
factors from mitochondria during cell apoptosis (Forte & 
Bernardi, 2006). CypD may also interact with mitochondrial 
adenine nucleotide transporters (ANT, Halestrap et al., 1998; 


Science Press 


Hunter & Haworth, 1979) and promote “open” conformation of 
ANT (Hunter & Haworth, 1979). Moreover, CypD can suppress 
apoptosis when it is overexpressed (Li et al., 2004; Lin & 
Lechleiter, 2002; Schubert & Grimm, 2004). 

Cyclophilins from the freshwater pearl mussel (Hyriopsis 
schlegelii) (Wang et al., 2016) are related to cell growth and 
immunity (Luo et al., 2015; Xie et al., 2011). However, whether 
CypD from Hyriopsis schlegelii (HsCypD) has a conserved 
structure remains unclear and its predicted function has yet to 
be reported. Here, we describe the predicted cDNA sequence 
and protein structural features of HsCypD. 


MATERIALS AND METHODS 
Experimental animals 


Healthy four-year-old H. schlegelii individuals (n=15), with shell 
lengths averaging 150.0+10.4 mm, were obtained from the 
Fuzhou Hongmen Reservoir Exploitation Corporation, Jiangxi 
Province. They were kept in aerated freshwater at 23+2 °C for 
one week before the tissues were harvested. 


Total RNA extraction, cDNA synthesis, and cloning 

Total RNA extraction was performed using TRIzol Reagent 
(Invitrogen) per the manufacturer's protocols. After the 
evaluation of RNA quantity, purity, and integrity, RNA from the 
gonads was used to prepare cDNA with the SMART RACE Kit 
(Clontech, USA). A cDNA library for H. schlegelii was 
constructed using the SMART cDNA Library Construction Kit 
(Clontech, USA). The full-length cDNA of HsCypD was cloned 
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by RACE methods, with gene specific primers (Supplementary 
Table 1) based on the known EST sequence. 


Bioinformatics analysis 

The cDNA fragments of HsCypD were assembled into complete 
full-length cDNA. The open reading frame was examined using 
ORF Finder. 

Protein molecular weight, isoelectric point (pl), and amino 
acid composition were analyzed with the Compute pl/Mw 
function of the ExPASy-ProtParam tool (http://web.expasy.org/ 
protparam/). Protein hydrophobicity was analyzed using 
ExPASy-ProtScale  (http://web.expasy.org/protscale/). Protein 
subcellular localization was predicted with PSORT Il 
(http:/Awww.genscript.com/tools/psort). Signal peptides were 
predicted using SignalP 3.0 (http:/Avww.cbs.dtu.dk/services/ 
SignalP-3.0/). The protein folding model was predicted using 
Predict Protein from Columbia University (https://www. 
predictprotein.org/). The protein domains were predicted using 
the Conserved Domain Database (https: //www.ncbi.nim.nih. 
gov/Structure/cdd/wrpsb.cgi). The three-dimensional structures 
of the protein sequences were predicted using the ExPASy 
SWISS-MODEL program (http://swissmodel.expasy.org/). 

Alignment of the amino acid sequences of HsCypD, HsCypC, 
and HsCypH with other species was performed using ClustalW 
Multiple Alignment (http: //www.ebi.ac.uk/clustalw/). A 
phylogenetic tree was constructed by neighbor-joining (NJ) 
using MEGA 4.0 (Tamura et al., 2007) with 1 000 bootstrap 
replicates, based on amino acids alignment. 


mRNA expression analysis by quantitative real-time PCR 
Real-time PCR was applied to examine the mRNA levels of 
HsCypD in 10 tissues using primer pairs CypD-F and CypD-R 
(Supplementary Table 1). Each assay was performed in 
triplicate with B-actin as the internal reference. Real-time PCR 
was performed for each cDNA sample on a Mastercycle ep 
Realplex2 Real-Time Thermal Cycler (Eppendorf) with SYBR 
Premix Ex TaqTM (TaKaRa). The 2°" method (Schmittgen & 
Livak, 2008) was used to analyze expression levels. All data 
were relative mRNAs expressed as mean+SD (n=3), and 
subjected to Student's t-test. 


RESULTS 


cDNA sequence of HsCypD and predicted protein features 

The complete cDNA sequence of HsCypD was 2 671 bp in 
length, containing a 5'-untranslated region (UTR, 80 bp), 3'- 
UTR (1 487 bp), polyadenylation signal (AATAAA), and 
unstable signal (ATTTTTA). The open reading frame (ORF) was 
1 104 bp, encoding 367 amino acids (Figure 1). The sequence 
was deposited in GenBank under accession number KJ747387. 
The predicted protein of HsCypD had a predicted isoelectric 
point (pl) of 5.43. The largest portion of the HsCypD residues 
was hydrophobic (131 amino acids), followed by uncharged 
polar amino acids (103 amino acids) (Supplementary Table 2). 
The hydrophobicity score of HsCypD was highest (1.544) at the 
24" and 275" sites and lowest (2.856) at the 260" site 
(Supplementary Figure 1). Hydrophobicity of HsCypD was not 
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obvious on the atlas, but the strength of hydrophilicity was clear. 
From the 330" to 360" residues, there was a small area of 
hydrophilicity that appeared to be very dense (Supplementary 
Figure 1). Moreover, the grand average of hydropathicity 
(GRAVY) of HsCypD was -0.703, thus showing obvious 
hydrophilicity. The proportion of hydrophilic amino acids (about 
64%) was much larger than that of hydrophobic amino acids 
(about 36%) (Supplementary Table 2). Therefore, HsCypD was 
predicted to be a hydrophilic protein. 

Protein subcellular localization results showed that 69.6%, 
8.7%, and 4.3% of the HsCypD protein was distributed in the 
cytoplasm, cell nucleus, and cell membrane, respectively. The 
distribution of the protein inside the cell was higher than that 
outside, indicating an intracellular protein. No obvious signal 
peptide characteristics were observed at the N-terminal 
(Supplementary Figure 2A,B). 

Eight possible folding patterns of HsCypD were identified: a 
N-glycosylation site at 118-122; casein kinase Il (CKIl) 
phosphorylation sites at 78-82, 83-87, 199-203, and 257-261; 
N-myristoylation sites at 74-80, 177-183, 200-206, and 312— 
318; protein kinase C (PKC) phosphorylation sites at 99-102, 
159-162, and 204-207; a cyclophilin peptidyl-proline cis-trans 
isomerase signal area at 58-76; CAMP and CGMP dependent 
protein kinase phosphorylation sites at 237-241 and 267-271; 
a leucine zipper model at 277—299; and a tyrosine kinase 
phosphorylation site at 354-362 (Supplementary Table 3). The 
secondary structures observed not only included a-helices, but 
also B-pleated sheets, B-turns, and random coils. Among them, 
a-helices (H) accounted for 37.87%, B-turns (E) accounted for 
14.44%, and other structures (L) accounted for the remaining 
47.68%. Random coils and a-helices were distributed uniformly 
throughout the protein; however, the a-helices were more 
obvious at the C terminal. 

The HsCypD had four tetratricopeptide repeat (TPR) domains 
that contained TPR-1 and TPR-11, each having two hits. The 
final specific hit was for a cyclophilin ABH_like, cyclophilin A, B, 
and H-like cyclophilin-type peptidylprolyl cis-trans isomerase 
(PPlase) domain, representing an archetypal cytosolic 
cyclophilin similar to human cyclophilin A, B, and H. 

The predicted three-dimensional structure was mainly 
composed of three a-helices, eight B-strands, some B-turns, 
and random coils. It was barrel-shaped, and the top and bottom 
were a combination of loops and three a-helices, which 
connected with both ends of the B-strands. The a-helices were 
more obvious in the C terminal (Supplementary Figure 3). 





Phylogenetic relationship of HsCypD to homologs of 
other species 
The HsCypD was homologous with CypD from other species 
(Supplementary Table 4), and exhibited the highest homology 
(61%) with Lottia gigantea. Multiple alignments revealed that 
the signature sequence of the CypD family could be identified in 
HsCypD (YKGCIFHRIIKDFMVQGG), and that the residues 
involved in CsA binding and PPlase activities were well 
conserved (Figure 2). 

Two major branches of vertebrates and invertebrates were 
classified on the CypD phylogenetic tree. HsCypD was found 


on the same branch as Lottia gigantea. Homo sapiens and 
Gorilla gorilla belonged to the same branch, and were the most 
distant from HsCypD (Figure 3). Furthermore, the phylogenetic 
tree based on CypD, CypC, and CypH sequences showed that 
HsCypH and CypDs from most species were grouped together. 
In particular, the CypC family and most of the CypD family were 
grouped together on a large branch. CypD from Lottia gigantea 
belonged to a lineage near the CypH group and CypH from 
Oryctolagus cuniculus belonged to a lineage near the CypC 


group (Supplementary Figure 4). 


Tissue expression profile of HsCypD 

The constitutive mRNA expression level of HsCypD was 
examined in 10 different tissues, including hemocytes, gill, 
mantle, kidney, heart, intestine, hepatopancreas, adductor 
muscle, gonad, and foot. The highest expression of HsCypD 
was detected in the intestine, with remarkable tissue expression 
patterns (Figure 4). 
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ATGGCGTCTGTAGAAAATCCGCGTGTCTTCTTTGACGTAGAAATAGGTGGAGACAATGTT 
MA S V ERRER Hry ERC G D NA 
GGTCGTATTGTCTTTGAGTTGTTCAAAGACAAAGTTCCAAAGACGGCAGAGAATTTCCGA 
I V FEK D K V RER A RE E: 
GCTCTGTGTACAGGAGAGAAAGGAGAGGGAAAATTTGGCAAGCCATTACATTACAAAGGA 


Zine hn r EH haw 


TGCATTTTCCACAGGATTATCAAAGACTTCATGGTGCAGGGAGGAGATTTCACCAATGCT 
tt ®f###{FT tt ie 
GATGGCACTGGAGGAGAAAGCATCTATGGAGAAAAATTTGAGGATGAAGACTTCACGCAA 
D EGG S eos F Peeves D Be T Q 
AAACATGATGTCCCTGGCTTACTTAGTATGGCCAATGCTGGTCCAAACACAAATGGTTCC 
K gD V P ERST A ce P NR 
CAGTTCTTCATTACAACAGTTCCAACTCCACACCTAGATGGGAAACACGTGGTCTTTGGA 
Cia T T V P T P E G Ge ee 
AGGGTCCTGAAGGGCATGGATGTTGTTAGAACTCTTGAGAATGAGGAAGTCAAAAGTGAA 
RES LEBM D V V R TEN E EVKSE 
AAGCCTGTGAAGGAATGCAAAATCGTTGATTGTGGGGAAATTCCTCCAGGGGCTGATGAT 
K BBV K EF RK EV D) ESMAI: P P GAD OD 
GGTGTACCAGTGGACGATGGAACAGGTGACAAATATGCTGAAAATCCTGATGATTCTGGT 
G V P V DSC T RRIK Y À E fe D S G 
CTTGACTTTACTGATAAAAACAGTGTCAACAGTGTAATCAATGTTATTGATGAGATACGC 
L Do BE. T D KN OS V S V N V I DE IR 
AAGATAGGAAATAATTTGTTTAAAGACAAGGAAGTATTGAAGGCTAAGAAAAAGTATTCA 
K I G N L FESD K E V L KEN K KEE S 
AAGGCACTCAGATATGTTGAGAAATTGAAAAATGATTTGGAACTTGGTTCAGATGAAGAG 
K AL RYE V EK LKNODLELGS DEE 
GATGATGATGAGATAGATAAGAAACAGACATTGTCAATTTACCTTAATTTAGCTGCTTGC 
DDODETIODK K QTL S Y BEL A ACR 
AACATACAGCTACAGAAATATGATGAGGCTTTAAAACATTGTAACAAGGCTCTAGATATT 
NIQtL@Q kK Y D E RL K H Men K ABD I 
GATGAGAAGAATGTGAAAGCTTTGTTCAGACGAGGCCAGACCCATAATGCAATGAAGAAC 
D E K V ESL FHN R CHET H N AM K N 
TGGGAAGAGGCATTGGGAGATCTTCAGAGAGCAGCAGAACTAGAACCAAATGACAAGGGT 
WE ERSL G DRO RENA E LENDIK G 
ATAAGAAAGGAGAAAGTGAAAAAGGCCAGAGATGCTTACAAAACTGAAGAGAAAAAACTA 
R KESK V K K A ROD A Y K T E EK K L 
TATGCTAAAATGTTCTCAGCATGATAGTGCCATTGTGTCACTAATCATTTATTCCTCATG 
val A a S À * 
TATATGTATATGTGGTTAGTTTTTATATGTAAGCAACTGTGAATTTATTGCATTTCTCGT 
ACTGTCATTTATCATTATTTTTGTAACGTCATGATCATTGTTTACCAGCAATTTATTCAT 
TTATAGACATATATTATGGACCAGGTTTTCTGCTGCCACTATAATGAAGTACTTTGTCCA 
TATATATATATATACAAAAAATAACTTGTCTCTCAGTAGATAATTTCTGGAATGCCAAGA 
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1461 GCTGACTCTTCTACCTGACTCTTGCATTAGCAAATTGTAAATTGCTTGCCTTAATGGTAC 
1521 AAAGATGTCCAGATTAACATATCAGATGTAAGTTTTGTTCTGAGCCACCCTGAGACTTT C 
1581 CTTCCTGGTACAAGAATATCCAATCTGAAAAATTATTAGTAATTTTGTTTTAAACCACAT 
1641 TTGAGAACTACATTGATGTAGGAAATGGGTCAACTTCTAAGTTCTTTGCAAACAGTAGT T 
1701 GTAGGAACAACAAATAATCTGGTAATAAATTATTATTTTGTCTGAATTTTAGTGAAAATG 
1761 TGTGGTGTGTTTGGTGTAACTCGGCAGCCTGTTGAGTGTACATGCATTTCACCTATTTCC 
1821 TTTAATGATCCAAGGATTGGGGAAAATTTTATTTTGCTGAATAGCTTCTTAGGCATTTTT 
1881 CTATATTATGTTTTTGTTTGCAATCCGAAGAATACCTGTAAAGTCTCTGTCCAGTCCCAA 
1941 TTTACAGTCTTGTTAAATAATGCTTATTAATGTGTAAGAGTGCCTCAATGGCCTCTTGAT 
2001 CCAAACCTTTTGTCAGTCTTGCAAATCACCTCAAATTACTGTGTAAGGATTATTGTTAAC 
2061 AATACTACTAGCATGGAATGTTAGAAAGATGCAAGCTATTCCCAGTGAAAATATCGGCTT 
2121 TTGTGACAATATTCAAGTATGAAAACCATCATTAGCAAAAGCGTAAATATGATGGGATTG 
2181 ATCAGTACGTGCTAATAGCACTGATGTAGAGTTATCCAGGAAAGGAGTTTTGACTATGGG 
2241 AGTGTTTTGGAAAATGTCGGTAAATAACACAGCGAACTGTAACTTGATTGTGCATTCAGT 
2301 GATATTTTTTCTACCAGATAATATCTCGCTTTGCTCAGTTTGAATGTGATAAGTAAATGA 
2361 GAGCATTCTGGTGATAATTATATTTGGTCTTCTTAATATAAGAAGTTGCCAATACTGTCC 
2421 ATTCAAAATTTAGAACATTAATCGGTTGACAATTCTTTTATTTAGTAGGAAAAATGTTTA 
2481 GGTTTTTTACTGTAGAAACTGATAAGATATTTCTTGGGAATGTTCAGATCATTGTGGCAT 
2541 AAATGTGTATTGACACATTAAATCATTAAGTTGATGTAAATGTATGGAAAACGAATTTTT 
2601 ATGTAATGGTATGCTGTCCATTTTATAATAAAATTCTGAAAAAA AAAAAAAAAAAAAAAA 
2661 AAAAAAAAAAA 


Figure 1 cDNA and deduced amino acid sequence of HsCypD 


*: Stop codon. Initiation codon (ATG), stop codon (TGA), unstable signal ATTTTTA, poly(A), and poly adenosine signal (AATAAA) are in bold. Conserved 
amino acid residues are shadowed. Framed area is proline cis-trans isomerase signal area. 
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Figure 2 Identification of a highly-conserved region across CypD amino acid sequences and their homologs 
Sequences were aligned using ClustalW alignment. (*) single fully-conserved residue; (: ) conservation of strong group; (.) conservation of weak group. 
Boxes show the well-conserved amino acid regions (18 aa residues) of all analyzed CypDs. The sequences used in the analyses are listed in 


Supplementary Table 4. 


DISCUSSION 
HsCypD was identified and characterized in the freshwater 


pearl mussel Hyriopsis schlegelii. It comprised a cyclophilin- 
type peptidyl-prolyl cis-trans isomerase (PPlase) region, four 
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significant TPR domains, and specific tertiary structure, as 
shared by other Cyps families (Blackburn et al., 2015; Ottiger et 
al., 1997; Wang et al., 2009). The PPlase region is highly- 
conserved in cyclophilins in vertebrates and invertebrates 
(Wang et al., 2009). The TPR domain connects to the single N- 
terminal cyclophilin domain of Cyp40 by a 30-amino acid linker 
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Figure 3 Phylogenetic tree (neighbor-joining) of CypD sequences including Hyriopsis schlegelii and 16 other species constructed using 


MEGA 4.0 


The values at the nodes indicate the percentage of trees in which this grouping occurred after bootstrapping (1 000 replicates; shown only when >60%). 


The sequences used in the analyses are listed in Supplementary Table 4. 
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Figure 4 HsCypD mRNA expression levels in different tissues of H. schlegelii 
HsCypD transcript levels in hemocytes, gill, mantle, kidney, heart, intestine, hepatopancreas, adductor muscle, gonad, and foot were normalized to that 
of hemocytes. Three individuals were used in each experiment for tissue collection and the experiment was repeated three times. Vertical bars 


represent mean+=SD (n=3). 


(Taylor et al., 2001). Similar to the tertiary structure of human 
CypA and CypB (Mikol et al., 1994; Ottiger et al., 1997), 
HsCypD exhibited a right-handed barrel structure formed by 
eight B-strands. The top and bottom of this structure combined 
loops and three a-helices connected with both ends of the B- 
strands. HsCypD showed high conservativeness in these 


domains and tertiary structure. 

The phylogenetic tree confirmed that HsCypD was more 
distantly related to CypDs from vertebrates than from 
invertebrates, similar to the evolutionary structure of CypA from 
V. philippinarum (Chen et al., 2011). Generally, the same types 
of cyclophilins (e.g., CypA and CypD) but isolated from different 
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species are more closely related to each other than different 
types of cyclophilins from the same species (Lee et al., 2002), 
although there are exceptions (Chen et al., 2011). In this study, 
HsCypH belonged to a lineage near the CypD group on the 
phylogenetic tree, with CypD from L. gigantea grouped with 
CypH and CypH from O. cuniculus grouped with CypC 
(Supplementary Figure 4). These findings demonstrate the 
close phylogenetic relationship of cyclophilins and suggest that 
this family likely has a common origin and is highly conserved. 

N-myristoylation is a lipid anchor modification of some 
proteins targeting them to membrane locations, thus 
transforming the function of the modified proteins, and plays a 
significant role in many cellular pathways, such as apoptosis, 
signal transduction, and alternative extracellular protein export 
(Borgese et al., 1996; Maurer-Stroh et al., 2002). PKC is an 
important neurotransmitter in intracellular signal transduction 
and participates in transmembrane signaling (Nishizuka, 1984). 
Another protein kinase, CKII, a highly conserved 
serine/threonine kinase of eukaryotic cells, is responsible for 
responding to growth factors (Marais et al., 1992). Tyrosine 
kinase is a key molecule in signal transduction and growth 
control (Cheng et al., 1993). The TPR domain can bind 
competitively to Hsp90 or Hsp70 and thus serve as co- 
chaperones (Young et al., 1998). The predicted HsCypD 
possessed these binding sites and domains. Thus, we 
speculated that HsCypD might have the ability to anchor to 
membranes, and might be involved in specific transfer 
processes of signal transduction and growth of cells, as well as 
performing as a chaperone. 

Cyps are widely distributed in various tissues (Danielson et 
al., 1988; Qiu et al., 2009). The high expression of Cyps in 
tissues is related to certain functional mechanisms (Qiu et al., 
2009; Watashi et al., 2005). In this study, the highest mRNA 
expression level of HsCypD was detected in the intestine. We 
speculated that HsCypD was very active in the intestine and 
might be involved in specific transfer processes of signal 
transduction and cytoprotection (Hausenloy et al. 2012; 
Tavecchio et al., 2013). Further systematic research is currently 
underway to characterize the functions and regulatory 
mechanisms of HsCypD. 
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ABSTRACT 


A computer software package called ‘FasParser’ 
was developed for manipulating sequence data. It 
can be used on personal computers to perform 
series of analyses, including counting and viewing 
differences between two sequences at both DNA 
and codon levels, identifying overlapping regions 
between two alignments, sorting of sequences 
according to their IDs or lengths, concatenating 
sequences of multiple loci for a particular set of 
samples, translating nucleotide sequences to amino 
acids, and constructing alignments in several 
different formats, as well as some extracting and 
filtrating of data for a particular FASTA file. Majority 
of these functions can be run in a batch mode, which 
is very useful for analyzing large data sets. This 
package can be used by a broad audience, and is 
designed for researchers that do not have 
programming experience in sequence analyses. The 
GUI version of FasParser can be downloaded from 


https://github.com/Sun-Yanbo/FasParser, free of charge. 


Keywords: FasParser; Batch processing; Sequence 
comparison; Extraction and filtration 


INTRODUCTION 


Recent developments in sequencing technology function to 
generate a vast amount of DNA and RNA sequence data. 
Analyses based on these sequences are one of the most important 
means of assessing their potential for biological inference. The 
amount of available sequence data has made their manipulation 
tricky, especially for researchers without programming experience. 
Hence, the development of user-friendly software facilitates 
research using batch modes for sequence extraction, filtration, 
translation and conversions of file formats. 

The program package MEGA (Kumar et al., 1994), which 
was developed decades ago, has achieved worldwide usage. 
Although it has manipulation functions, such as sequence 
viewing and format conversion, it focused mainly on various 
statistical analyses of molecular evolution. Many sequence 
manipulations still require manual work or the use of other tools 


110 Science Press 


(i.e., Microsoft Office Excel). Examples include the 
concatenation of loci from multiple sequence files, the 
extraction of some gene sequences from a whole genome, and 
the filtering of very short sequences in an alignment. Another 
package, BioEdit (Hall, 1999), can handle most simple 
sequence editing and manipulation functions. However, it 
inefficiently handles batch processing and can only deal with 
one alignment file at a time. 

Herein, | provide the new program package ‘FasParser’ for 
manipulating sequence files. It has a user-friendly GUI and 
batch processing modes, which allows users to handle multiple 
sequence files in a simple way. Presently, the package has 
seven main programs/functions (Figure 1): (1) counting and 
viewing the differences between two sequences at the DNA and 
codon levels; (2) identifying overlapping columns of two 
alignments of a same gene; (3) sorting sequences according to 
ID, sequence length, or ID list provided by user; (4) concatenating 
sequences for a particular set of samples from multiple 
sequence files; (5) batch-translating protein —coding nucleic 
acid sequences into amino acids; (6) constructing alignments 
with different formats; and (7) extracting and filtering sequences 
according to ID or sequence length. FasParser is a standalone 
application that has been compiled and tested on Windows 
7/10 operating systems. Only available computer memory limits 
the size of data to be analyzed. 


BATCH PROCESSING 

This new package can batch process several commonly used 
procedures including merging sequences, translating, aligning 
and converting formats. For merging, it can obtain a “super 
sequence” by concatenating all the loci sequences for a 
particular set of samples. This is useful for phylogenetic 
inference. The translation program can obtain the amino acid 
sequences according to multiple genetic codes. In addition to 
the batch processing, it can also read single FASTA file or 
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Figure 1 Overview of the functions provided by FasParser 


single DNA sequence (manual mode), thus providing a simple 
way to get the amino acid sequences. Alignment construction is 
one of the most important manipulations of sequences and the 
program can make use of three popular aligners for it: MUSCLE 
(Edgar, 2004), MAFFT (Katoh et al., 2002), and PRANK 
(LOytynoja & Goldman, 2005). The first two programs can 
generate final alignments quickly and automatically recognize 
the type of sequence (DNA or amino acid). Although PRANK is 
slower than the others, it produces more accurate results 
(Jordan & Goldman, 2012) and can directly obtain final 
alignments at the codon-level. In addition, FasParser can 
convert alignments to different formats, for example from 
FASTA to PHYLIP, PAML, or NEXUS. Batch processing of 


Raw alignments 








~4--hcECTGCGGCAGCAACAI aa 
AGCCATECTGCGGCAGCAACA} eium 
AGCCATECTGCGGCAGCAACAI—>|] column 
AGCCATECTGCGGCAGCAACA} Column 
AGCCATECTGCGGCAGCAACA} iret 
ad--}-Ectececcaccaaca} ei 
AGCCATECTGCGGCAGCAACA} aa 
AGCCATECTGCGGCAGCAACAL—> 

AGCCATECTGCGGCAGCAACA} Column 
AGCCATECTGCGGCAGCAACA} i 


Figure 2 Algorithm used to compare different alignments 
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these functions only needs a directory containing all the 
sequence files to be analyzed. 


SEQUENCE 
IDENTIFICATION 
After constructing an alignment, it is often desirable to visualize 
the mutations or substitutions between two sequences, and/or 
identify overlapping regions generated by different aligners for 
the same gene. The programs “Cmp-2Seq” and “Cmp-2Align” 
address these issues. Cmp-2Seq counts and displays 
differences between two sequences at the levels of nucleotides 
and codons. Under the codon level, the program estimates the 
total number of sites with synonymous (S) and non-synonymous 
(N) substitutions for the first sequence and then calculates the 
number of synonymous and non-synonymous substitutions 
between the two sequences according to the NG86 method of 
Nei & Gojobori (1986). This function is useful in analyses, such as 
cancer genomic studies that focus on understanding the selective 
pressures following cell proliferation (Liu et al., 2012). 

Cmp-2Align identifies overlapping regions between two 
alignments using a simple but rigorous algorithm (Figure 2). 
Briefly, for each base of an alignment column, the program 
calculates its gap-free position in the raw sequence. Next, it 
transforms these positions to a string vector, like “1-2-2”, 
meaning there are 3 sequences, and this column contains the 
first base of the first sequence, the second base of the second 
sequence and also the second base in the third sequence. 
Finally, the program extracts all columns with the same position- 
vectors between two alignments (Figure 2). This manipulation is 
useful for analyses such as the identification of regions 
informative for phylogenetic inference. 


COMPARISON AND MUTATION 


Position-vectors 
12 


0. T iss 
1-5-5-5-5 Overlapping regions 
2-6-6-6-6 
eee -~-CCTGCGGCAGCAACA 
CCCCTGCGGCAGCAACA 
mamy CCCCTGCGGCAGCAACA 
terien CCCCTGCGGCAGCAACA 
a tree CCCCTGCGGCAGCAACA 
0=3=3=3=3 
3-7-7-7-7 
4-8-8-8-8 


The red-colored columns in the two alignments hold the same position-matrix, which are identified as the final overlapping regions by FasParser. 


EXTRACTION AND FILTRATION 

FasParser can also extract and filter a set of sequences from a 
raw FASTA file (“Fas-Filter’) based on query IDs, as well as 
removing sequences according to a cutoff-length. Fas-Filter can 
cut a raw alignment by removing columns with gaps based on a 
cutoff value of gap frequency. Moreover, the program can also 
provide summary-statistics of a raw alignment, such as pointing 


out one or more too short sequences and calculating the length 
of gap-free blocks. 


COMPARISONS BETWEEN FASPARSER WITH OTHER 
PROGRAMS 

The FasParser package provides a graphic user interface (GUI) 
with several commonly used functions that perform sequence 
manipulations. This package remains limited in that it cannot 
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perform phylogenetic inference, edit alignments and identify open 
reading frames (ORF) (Table 1). Therefore, FasParser is not a 


Table 1 Comparisons between FasParser with other programs 


replacement of other packages, such as MEGA. Nonetheless, 
new functions to FasParser are in the process of development. 





MEGA BioEdit FasParser 
GUI N J J 
Batch processing x x y 
Sequence viewing y 1 J 
Alignment comparison x x 4 
Translation y J J 
format conversion y y Ni 
Extraction & filtration x y N 
Sequence editing V y x 
Phylogeny inference N y x 
ORF identification x y x 
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